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POLYPEPTTOB-POLYMER CONJUGATES HAVING ADDED AND/OR REMOVED ATTACHMENT GROUPS 

FIELD OF THE INVENTION 

The present invention relates to polypeptide-polymer 
5 conjugates having added and/or removed one or more attachment 
groups for coupling polymeric molecules on the surface of the 3D 
structure of the polypeptide, a method for preparing polypeptide- 
polymer conjugates of the invention, the use of said conjugated 
for reducing the immunogenicity and allergenicity, and 
10 compositions comprising said conjugate. 

BACKGROUND - CF ™ THE "INVENT IGN 

The use of polypeptides, including enzymes, in the 
circulatory system to obtain a particular physiological effect is 

15 well-known in the medical arts. Further, within the arts of 
industrial applications, such as laundry washing, textile 
bleaching, person care, contact lens cleaning, food and feed 
preparation enzymes are used as a functional ingredient. One of 
the important differences between pharmaceutical and industrial 

20 application is that for the latter type of applications (i.e. 
industrial applications) the polypeptides (often enzymes) are not 
intended to enter into the circulatory system of the body. 

Certain polypeptides and enzymes have an unsatisfactory 
stability and may under certain circumstances - dependent on the 

25 way of challenge - cause an immune response, typically an IgG 
and/ or IgE response. 

It is today generally recognized that the stability of 
polypeptides is improved and the immune response is reduced when 
polypeptides, such as enzymes, are coupled to polymeric molecules. 

30 It is believed that the reduced immune response is a result of the 
shielding of (the) epitope (s) on the surface of the polypeptide 
responsible for the immune response leading to antibody formation 
by the coupled polymeric molecules. 

Techniques for conjugating polymeric molecules to polypeptides 

35 are well-known in the art. 

One of the first suitable commercially techniques was described 
back in the early 1970' ies and disclosed in e.g. US patent no. 
4,179,337. Said patent concerns non-immunogenic polypeptides, such 
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as enzymes and peptide hormones coupled to polyethylene glycol 
(PEG) or polypropylene glycol (PPG) . At least 15% of polypeptides 1 
physiological activity is maintained. 

GB patent no. 1, 183 , 257 (Crook et al.) describes chemistry for 
5 conjugation of enzymes to polysaccharides via a triazine ring. 

Further, techniques for maintaining of the enzymatic activity 
of enzyme-polymer conjugates are also known in the art. 

WO 93/15189 (Veronese et al.) concerns a method for maintaining 
the activity in polyethylene glycol-modif ied proteolytic enzymes 
10 by linking the proteolytic enzyme to a macromolecularized 
inhibitor. The conjugates are intended for medical applications. 

It has been found that the attachment of polymeric-molecules to 
a polypeptide often has the effect of reducing the activity of the 
polypeptide by interfering with the interaction between the 
15 polypeptide and its substrate. EP 183 503 (Beecham Group PLC) 
discloses a development of the above concept by providing 
conjugates comprising pharmaceutically useful proteins linked to 
at least one water-soluble polymer by means of a reversible 
linking group. 

20 EP 471,125 (Kanebo) discloses skin care products comprising a 
parent protease (Bacillus protease with the trade name Esperase®) 
coupled to polysaccharides through a triazine ring to improve the 
thermal and preservation stability. The coupling technique used is 
also described in the above mentioned GB patent no. 1,183,257 

25 (Crook et ah)* 

JP 3083908 describes a skin cosmetic material which 
contains a transglutaminase from gruinea pig liver modified with 
one or more water-soluble substance such as PEG, starch, 
cellulose etc. The modification is performed by activating the 

30 polymeric molecules and coupling them to the enzyme. The 
composition is stated to be mild to the skin. 

However, it is not always possible to readily couple 
polymeric molecules to polypeptides and enzymes. Further, there is 
still a need for poiypeptide-polymer conjugates with an even more 

35 reduced immunogenicity and/ or allergenicity . 

SUMMARY OF THE INVENTION 

It is the object of the present invention to provide improved 
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polypeptide-polymer conjugates suitable for industrial and 

pharmaceutical applications. 

The term "improved polypeptide-polymer conjugates" means in the 

context of the present invention conjugates having a reduced 
5 immune response in humans and animals and/or a improved stability. 

As will be described further below the immune response is 

dependent on the way of challenge. 

The present inventors have found that polypeptides, such as 

enzymes, may be made less immunogenic and/or allergenic by adding 
10 and/ or removing one or more attachment groups on the surface of 

the parent polypeptide to be coupled to polymeric molecules. 

-When introducing ~phanuaceut~i~ca~l polypeptide directly into the 

circulatory system (i.e. bloodstream) the potential risk is an 

immunogenic response in the form of mainly IgG, IgA and/or IgM 
15 antibodies. In contrast hereto, industrial polypeptides, such as 

enzymes used as a functional ingredient in e.g. detergents, are 

not intended to enter the circulatory system. The potential risk 

in connection with industrial polypeptides is inhalation causing 

an allergenic response in the form of mainly IgE antibody 
20 formation. 

Therefore, in connection with industrial polypeptides the 
potential risk is respiratory allergenicity caused by inhalation, 
intratracheal and intranasal presentation of polypeptides. 

The main potential risk of pharmaceutical polypeptides is 
25 immunogenicity caused by intradermally , intravenously or subcu- 
taneously presentation of the polypeptide. 

It is to be understood that reducing the "immunogenicity" 
and reducing the "respiratory allergenicity" are two very 
different problems based on different routes of exposure and on 
30 two very different immunological mechanisms: 

The term "immunogenicity" used in connection with the 
present invention may be referred to as allergic contact 
dermatitis in a clinical setting and is a cell mediated delayed 
immune response to chemicals that contact and penetrate the skin. 
35 This cell mediated reaction is also termed delayed contact 
hypersensitivity (type IV reaction according to Gell and Combs 
classification of immune mechanisms in tissue damage) . 

The term "allergenicity" or "respiratory allergenicity" is an 
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immediate anaphylactic reaction (type I antibody-mediated reaction 
according to Gell and Combs) following inhalation of e.g. 
polypeptides . 

According to the present invention it is possible to provide 
5 polypeptides with a reduced immune response and/ or improved 
stability, which has a substantially retained residual activity. 

The allergic and the immunogenic response are in one term, at 
least in the context of the present invention called the "immune 
response" - 

10 In the first aspect the invention relates to a polypeptide- 
polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
polypeptide having been modified in a manner to increase the 
number of attachment groups on the surface of the polypeptide in 

15 comparison to the number of attachment groups available on the 
corresponding parent polypeptide, and/ or 

b) one or more fewer polymeric molecules coupled to the 
polypeptide having been modified in a manner to decrease the 
number of attachment groups at or close to the functional site(s) 

20 of the polypeptide in comparison to the number of attachment 
groups available on the corresponding parent polypeptide. 

The term "parent polypeptide" refers to the polypeptide to be 
modified by coupling to polymeric molecules. The parent 
po lypept ide may be a natur al ly-occurr ing ( or wi ld-type ) 

25 polypeptide or may be a variant thereof prepared by any suitable 
means. For instance, the parent polypeptide may be a variant of a 
naturally-occurring polypeptide which has been modified by 
substitution, deletion or truncation of one or more amino acid 
residues or by addition or insertion of one or more amino acid 

30 residues to the amino acid sequence of a naturally-occurring 
polypeptide. 

A "suitable attachment group" means in the context of the 
present invention any amino acid residue group on the surface of 
the polypeptide capable of coupling to the polymeric molecule in 
35 question. 

Preferred attachment groups are amino groups of Lysine 
residues and the N-terminal amino group. Polymeric molecules may 
also be coupled to the carboxylic acid groups (-COOH) of amino 
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acid residues in the polypeptide chain located on the surface. 
Carboxylic acid attachment groups may be the carboxylic acid group 
of Aspartate or Glutamate and the C-terminal COOH-group. 

A "functional site" means any amino acid residues and/or 
5 cofactors which are known to be essential for the performance of 
the polypeptide, such as catalytic activity, e.g. the catalytic 
triad residues, Histidine, Aspartate and Serine in Serine 
proteases, or e.g. the heme group and the distal and proximal 
Histidines in a peroxidase such as the Arthromyces ramosus 
10 peroxidase. 

In the second aspect the invention relates to a method for 
preparing improved po lypep t i de -po iymer conjugates -comprising -the 
steps of: 

a) identifying amino acid residues located on the surface of the 
15 3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a 

20 suitable attachment group,* and/ or 

ii) substituting or deleting one or more amino acid residues 
selected in step b) at or close to the functional site(s) , 

d) coupling polymeric molecules to the mutated polypeptide. 

The invention also relates to the use of a conjugate of the 
25 invention and the method of the invention for reducing the 
immunogenicity of pharmaceuticals and reducing the allergenicity 
of industrial products. 

Finally the invention relates to compositions comprising a 
conjugate of the invention and further ingredients used in 
30 industrial products or pharmaceuticals. 

BRIEF DESCRIPTION OF TEE DRAWING 

Figure 1 shows the anti-lipase serum antibody levels after 5 
weekly immunizations with i) control ii) unmodified lipase 
35 variant, iii) lipase variant-SPEG. (X: log(serum dilution); Y 
Optical Density (490/620)). 

DETAILED DESCRIPTION OF THE INVENTION 
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It is the object of the present invention to provide improved 
polypeptide-polymer conjugates suitable for industrial and 
pharmaceutical applications. 

Even though polypeptides used for pharmaceutical applications 
5 and industrial application can be quite different the principle of 
the present invention may be tailored to the specific type of 
parent polypeptide (i.e. enzyme, hormone peptides etc.). 

The inventors of the present invention have provided improved 
polypeptide-polymer conjugates with a reduced immune response in 
10 comparison to conjugates prepared from the corresponding parent 
polypeptides. 

The present inventors have found that polypeptides, such as 
enzymes, may be made less immunogenic and/or less allergenic by 
adding one or more attachment groups on the surface of the parent 
15 polypeptide. In addition thereto the inventors have found that a 
higher percentage of maintained residual functional activity may 
be obtained by removing attachment groups at or close to the 
functional site(s). 

In the first aspect the invention relates to an improved 
20 polypeptide-polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
polypeptide having been modified in a manner to increase the 
number of attachment groups on the surface of the polypeptide in 
comparison to the number of attachment groups available on the 

25 corresponding parent polypeptide, and/ or 

b) one or more fewer polymeric molecules coupled to the 
polypeptide having been modified in a manner to decrease the 
number of attachment groups at or close to the functional site(s) 
of the polypeptide in comparison to the number of attachment 

30 groups available on the corresponding parent polypeptide. 

Whether the attachment groups should be added and /or removed 
depends on the specific parent polypeptide. 

a) Addition of Attachment groups 
35 There may be a need for further attachment groups on the 
polypeptide if only few attachment groups are available on the 
surface of the parent polypeptide. The addition of one or more 
attachment groups by substituting or inserting one or more amino 
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acid residues on the surface of the parent polypeptide increases 

the number of polymeric molecules which may be attached in 

comparison to the corresponding parent polypeptide. Conjugates 

with an increased number of polymeric molecules attached thereto 
5 are generally seen to have a reduced immune response in comparison 

to the corresponding conjugates having fewer polymeric molecules 

coupled thereto. 

Any available amino acid residues on the surface of the 

polypeptide, preferentially not being at or close to the 
10 functional site(s), such as the active site(s) of enzymes, may in 

principle be subject to substitution and/ or insertion to provide 

sTddiirrdnal attachment groups . 

As will be described further below the location of the 

additional coupled polymeric molecules may be of importance for 
15 the reduction of the immune response and the percentage of 

maintained residual functional activity of the polypeptide itself. 
A conjugate of the invention may typically have from 1 to 25, 

preferentially 1 to 10 or more additional polymeric molecules 

coupled to the surface of the polypeptide in comparison to the 
20 number of polymeric molecules of a conjugate prepared on the basis 

of the corresponding parent polypeptide. 

However, the optimal number of attachment group to be added 

depends (at least partly) on the surface area (i.e. molecular 

weight) of the parent polypeptide to be shielded by the coupled 
25 polymeric molecules, and further off -course also the number of 

already available attachment groups on the parent polypeptide. 

b) Removing Attachment groups 

In the case of enzymes or other polypeptides performing their 

30 function by interaction with a substrate or the like, polymeric 
molecules coupled to the polypeptide might be impeded by the 
interaction between the polypeptide and its substrate or the like, 
if they are coupled at or close to the functional site(s) (i.e. 
active site of enzymes) . This will most probably cause reduced 

35 activity. 

In the case of enzymes having one or more polymeric molecules 
coupled at or close to the active site a substantial loss of 
residual enzymatic activity can be expected. Therefore, according 
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to the invention conjugates may be constructed to maintain a 

higher percentage of residual enzymatic activity in comparison to 

a corresponding conjugates prepared on the basis of the parent 

enzyme in question. This may be done by substituting and/or 
5 deleting attachment groups at or close to the active site, hereby 

increasing the substrate affinity by improving the accessibility 

of the substrate in the catalytic cleft. 

An enzyme-polymer conjugate of the invention may typically have 

from 1 to 25, preferably 1 to 10 fewer polymeric molecules coupled 
10 at or close to the active site in comparison to the number of 

polymeric molecules of a conjugate prepared on the basis of the 

corresponding "parent -polypeptide-. 

As will beft' explaineu below "at or close to" the functional 

site(s) means that no polymeric molecule (s) should be coupled 
15 within 5 A, preferably 8 A, especially 10 A of the functional 

site(s) . 

Removal of attachment groups at or close to the functional 
site(s) of the polypeptide may advantageously be combined with 
addition of attachment groups in other parts of the surface of the 
20 polypeptide. 

The total number of attachment groups may this way be 
unchanged, increased or decreased. However the location (s) of the 
total number of attachment group (s) is (are) improved assessed by 
the reduction of the immune response and/ or percentage of 
25 maintained residual activity. Improved stability may also be 
obtained this way. 

The number of a ttachment groups 

Generally seen the number of attachment groups should be 
30 balanced to the molecular weight and/or surface area of the 
polypeptide. The more heavy the polypeptide is the more polymeric 
molecules should be coupled to the polypeptide to obtain 
sufficient shielding of the epitope (s) responsible for antibody 
formation. 

35 Therefore, if the parent polypeptide molecule is relatively 
light (e.g. l to 35 kDa) it may be advantageous to increase the 
total number of coupled polymeric molecules (outside the 
functional site(s)) to a total between 4 and 20. 
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If the parent polypeptide molecules is heavier, for instance 35 
to 60 kDa, the number of coupled polymeric molecules (outside the 
functional site(s)) may advantageously be increased to 7 to 40, 
and so on. 

5 The ratio between the molecular weight (Mw) of the polypeptide 
in question and the number of coupled polymeric molecules 
considered to be suitable by the inventors is listed below in 
Table 1. 

10 Table 1 



Molecular weight of parent 
polypeptide (My,) kDa 


Number of polymeric 
molecules - coupled— to the 
polypeptide 


1 to 35 


4-20 


35 to 60 


7-40 


60 to 80 


10-50 


80 to 100 


15-70 


more than 100 


more than 20 



Reduced immune response vs. maintained residual enzymatic activity 
Especially for enzymes, in comparison to many other types of 
polypeptides, there is a conflict between reducing the immune 

15 response and maintaining a substantial residual enzymatic activity 
as the activity of enzymes are connected with interaction between 
a substrate and the active site often present as a cleft in the 
enzyme structure. 

Without being limited to any theory it is believed that the 

20 loss of enzymatic activity of enzyme-polymer conjugates might be a 
consequence of impeded access of the substrate to the active site 
in the form of spatial hindrance of the substrate by especially 
bulky and/or heavy polymeric molecules to the catalytic cleft. It 
might also, at least partly, be caused by disadvantageous minor 

25 structural changes of the 3D structure of the enzyme due to the 
stress made by the coupling of the polymeric molecules. 

Maintained residual activity 

A polypeptide-polymer conjugates of the invention has a 
30 substantially maintained functional activity. 
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A "substantially" maintained functional activity is in the 
context of the present invention defined as an activity which is 
at least between 20% and 30%, preferably between 30% and 40%, more 
preferably between 40% and 60%, better from 60% up to 80%, even 
5 better from 80% up to about 100%, in comparison to the activity of 
the conjugates prepared on the basis of corresponding parent 
polypeptides. 

In the case of polypeptide-polyroer conjugates of the 
invention where no polymeric molecules are coupled at or close to 

10 the functional site(s) the residual activity may even be up to 
100% or very close thereto. If attachment group (s) of the parent 
polypeptide is(are) removed from the functional site the activity 
might even be more than 100% in comparison to modified (i.e. 
polymer coupled) parent polypeptide conjugate. 

15 Position of coupled polymeric molecules 

To obtain an optimally reduced immune response (i.e. 
immunogenic and allergenic response) the polymeric molecules 
coupled to the surface of the polypeptide in question should be 
located in a suitable distance from each other. 

20 In a preferred embodiment of the invention the parent 
polypeptide is modified in a manner whereby the polymeric 
molecules are spread broadly over the surface of the polypeptide. 
In the case of the polypeptide in question has enzymatic activity 
it is preferred to have as few as possible, especially none, 

25 polymeric molecules coupled at or close to the area of the active 
site. 

In the present context "spread broadly over the surface of the 
polypeptide" means that the available attachment groups are 
located so that the polymeric molecules shield different parts of 

30 the surface, preferable the whole or close to the whole surface 
area away from the functional site(s) , to make sure that 
epitope (s) are shielded and hereby not recognized by the immune 
system or its antibodies. 

The area of antibody -polypeptide interaction typically 

35 covers an area of 500 A 2 , as described by Sheriff et al. 

(1987), Proc. Natl. Acad. Sci. USA 84, p. 8075-8079. 500 k 2 
corresponds to a rectangular box of 25 k x 20 k or a circular 
region of radius 12.6 A. Therefore, to prevent binding of 
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antibodies to the epitope (s) to the polypeptide in question it 
is preferred to have a maximum distance between two attachment 
groups around 10 A. 

Consequently, amino acid residues which are located in excess 
5 of 10 A away from already available attachment groups are 

suitable target residues. If two or more attachment groups on the 
polypeptide are located very close to each other it will in most 
cases result in that only one polymeric molecule will be coupled. 
To ensure a minimal loss of functional activity it is preferred 

10 not to couple polymeric molecules at or close to the functional 
site(s). Said distance depends at least partly on the bulkiness of 
the polymeric molecules to be coupled , as impeded access~by the 
bulky polymeric molecules to the functional site is undesired. 
Therefore, the more bulky che polymeric molecules are the longer 

15 should the distance from the functional site to the coupled 
polymeric molecules be. ' . 

To maintain a substantial functional activity of the 
polypeptide in question attachment groups located within 5 A, 
preferred 8 A, especially 10 A from such functional site(s) 

20 should be left uncoupled and may therefore advantageously be 
removed or changed by mutation. Functional residues should 
normally not be mutated/ removed, even though they potentially 
can be the target for coupling polymeric molecules. In said 
case it may thus be advantageous to chose a coupling chemistry 

25 involving different attachment groups. 

Further, to provide a polypeptide having coupled polymeric 
molecules at (a) known epitope (s) recognizable by the immune 
system or close to said epitope (s) specific mutations at such 
sites are also considered advantageous according to the invention. 

30 If the position of the epitope (s) is (are) unknown it is 
advantageous to couple several or many polymeric molecules to the 
polypeptide. 

As also mentioned above it is preferred that said attachment 
groups are spread broadly over the surface. 

35 

The attachment group 

Virtually all ionized groups, such as the amino groups of 
Lysine residues, are located on the surface of the polypeptide 
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molecule (see for instance Thomas E. Creighton, (1993), 
"Proteins", W.H. Freeman and Company, New York). 

Therefore, the number of readily accessible attachment groups 
(e.g. amino groups) on a modified or parent polypeptide equals 
5 generally seen the number of Lysine residues in the primary 
structure of the polypeptide plus the N-terminus amino group. 

The chemistry of coupling polymeric molecules to amino groups 
are quite simple and well established in the art. Therefore, it is 
preferred to add and/or remove Lysine residues (i.e. attachment 
10 groups) to/ from the parent polypeptide in question to obtain 
improved conjugates with reduced iramunogenicity and/or 
allergenicity and/ or improved stability alfd/or higiT percentage 
maintained functional activity. 

Polymeric molecules may also be coupled to the carboxylic 
15 groups (-COOH) of amino acid residues on the surface of the 
polypeptide. Therefore, if using carboxylic groups (including the 
C- terminal group) as attachment groups addition and/ or removal of 
Aspartate and Glutamate residues may also be a suitable according 
to the invention. 

20 If using other attachment groups , such as -SH groups , they 
may be added and/or removed analogously. 

Substitution of the amino acid residues is preferred over 
insertion, as the impact on the 3D structure of the polypeptide 
normally will be less pronounced. 

25 Preferred substitutions are conservative substitutions. In the 
case of increasing the number of attachment groups the 
substitution may advantageously be performed at a location having 
a distance of 5 A, preferred 8 A, especially 10 A from the 
functional site(s) (active site for enzymes). 

30 An example of a suitable conservative substitution to obtain 
an additional amino attachment group is a Arginine to Lysine 
substitution. Examples of conservative substitutions to obtain 
additional carboxylic attachment groups are Aspargine to 
Aspartate /Glutamate or Glutamine to Aspartate/Glutamate 

35 substitutions. To remove attachment groups a Lysine residue may be 
substituted with a Arginine and so on. 



The parent polypeptide 
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In the context of the present invention the term "polypeptides" 
includes proteins, peptides and/or enzymes for pharmaceutical or 
industrial applications. Typically the polypeptides in question 
have a molecular weight in the range between about 1 to 100 kDa, 
5 often 15 kDa and 100 kDa. 

Pharmaceutical polypeptides 

The term "pharmaceutical polypeptides" is defined as polypep- 
tides, including peptides, such as peptide hormones, proteins 
10 and/or enzymes, being physiologically active when introduced into 
the circulatory system of the body of humans and/or animals. 

Pharmaceutical polypeptides are potentially immunogenic as they 
are introduced into the circulatory system. 

Examples of "pharmaceutical polypeptides" contemplated 
15 according to the invention include insulin, ACTH, glucagon, 
somatostatin, somatotropin, thymosin/ parathyroid hormone, 
pigmentary hormones, somatomedin, erythropoietin, luteinizing 
hormone , chorionic gonadotropin , hypothalmic releasing factors , 
antidiuretic hormones, thyroid stimulating hormone, relaxin, 
20 interferon, thrombopoietin (TPO) and prolactin. 

Industrial polypeptides 

Polypeptides used for industrial applications often have an 

enzymatic activity. Industrial polypeptides (e.g. enzymes) are (in 
25 contrast to pharmaceutical polypeptides) not intended to be 

introduced into the circulatory system of the body. 

It is not very like that industrial polypeptides, such as 

enzymes used as ingredients in industrial compositions and/or 

products, such as detergents and personal care products, including 
30 cosmetics, come into direct contact with the circulatory system of 

the body of humans or animals, as such enzymes (or products 

comprising such enzymes) are not injected (or the like) into the 

bloodstream. 

Therefore, in the case of the industrial polypeptide the 
35 potential risk is respiratory allergy (i.e. IgE response) as a 
consequence of inhalation to polypeptides through the respiratory 
passage . 

In the context of the present invention "industrial polypep- 
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tides" are defined as polypeptides, including peptides, proteins 
and/or enzymes, which are not intended to be introduced into the 
circulatory system of the body of humans and/ or animals. 

Examples of such polypeptides are polypeptides, especially 
5 enzymes, used in products such as detergents, household article 
products, agrochemicals, personal care products, such as skin care 
products, including cosmetics and toiletries, oral and dermal 
pharmaceuticals, composition use for processing textiles, 
compositions for hard surface cleaning, and compositions used for 
10 manufacturing food and feed etc. 

Enzymatic activity 

Pharmaceutical or industrial polypeptides exhibiting enzymatic 
activity will often belong to one of the following groups of 

15 enzymes including Oxidoreductases (E.C. 1, "Enzyme Nomenclature, 
(1992), Academic Press, Inc.). such as laccase and Superoxide 
dismutase (SOD); Transferases, (E.C. 2), such as transglutaminases 
(TGases) ; Hydrolases (E.C. 3), including proteases, especially 
subtilisins, and lipolytic enzymes; Isomerases (E.C. 5), such as 

20 Protein disulfide Isomerases (PDI) . 

Hydrolases 

Proteolytic enzymes 

Contemplated proteolytic enzymes include proteases selected - 
25 from the group of Aspartic proteases, such pepsins, Cysteine 

proteases, such as Papain, Serine proteases, such as subtilisins, 

or metallo proteases, such as Neutrase®. 

Specific examples of parent proteases include PD498 (WO 

93/24623 and SEQ ID NO. 2), Savinase® (von der Osten et al., 
30 (1993), Journal of Biotechnology, 28, p. 55+, SEQ ID NO 3), 

Proteinase K (Gunkel et al., (1989), Eur. J . Biochem, 179, p. 185- 

194), Proteinase R (Samal et al, (1990), Mol. Microbiol, 4, p. 

1789-1792), Proteinase T (Samal et al., (1989), Gene, 85, p. 329- 

333), Subtilisin DY (Betzel et al. (1993), Arch. Biophys, 302, no. 
35 2, p. 499-502), Lion Y (JP 04197182-A) , Rennilase® (Available from 

Novo Nordisk A/S) , JA16 (WO 92/17576), Alcalase® (a natural 

subtilisin Carlberg variant) (von der Osten et al., (1993), 

Journal of Biotechnology, 28, p. 55+). 
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Lipolytic enzymes 

Contemplated lipolytic enzymes include Humicola lanuginosa 

lipases, e.g. the one described in EP 258 068 and EP 305 216 (See 
5 SEQ ID NO 6 below) , Humicola insolens, a Rhizomucor miehei lipase, 

e.g. as described in EP 238 023, Absidia sp. lipolytic enzymes (WO 

96/13578), a Candida lipase, such as a C. antarctica lipase, e.g. 

the C. antarctica lipase A or B described in EP 214 761, a 

Pseudomonas lipase such as a P. alcaligenes -and P. 
10 pseudoalcaligenes lipase, e.g. as described in EP 218 272, a P. 

cepacia lipase, e.g. as described in EP 331 376, a Pseudomonas sp. 

lipase as disclosed in WO 1*5/14783 , a Bacillus Upas e, evg. -a B . 

subtilis lipase (Dartois et al., (1993) Biochemica et Biophysica 

acta 1131, 253-260), a B. stearothermophilus lipase (JP 64/744992) 
15 and a B. pumilus lipase (WO 91/16422). Other types of lipolytic 

include cutinases, e.g. derived from Pseudomonas mendocina as 

described in WO 88/09367, or a cutinase derived from Fusarium 

solani pisi (e.g. described in WO 90/09446). 

2 0 Oxidoreductases 

Laccases 

Contemplated laccases include Polyporus pinisitus laccase (WO 
96/00290), Myceliophthora laccase (WO 95/33836), Schytalidium 
laccase (WO 95/338337) , and Pyricularia oryzae laccase (Available 
25 from SigmaJ . 

Peroxidase 

Contemplated peroxidases include B. pumilus peroxidases (WO 
91/05858), Myxococcaceae peroxidase (WO 95/11964), Coprinus 

3 0 cinereus (WO 95/10602) and Arthromyces ramosus peroxidase 

(Kunishima et al. (1994), J. Mol. Biol. 235, p. 331-344). 

Transferases 

Transglutaminases 

35 Suitable transferases include any transglutaminases disclosed 
in WO 96/06931 (Novo Nordisk A/S) and WO 96/22366 (Novo Nordisk 
A/S) . 
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Isomerases 

Protein Disulfide Isomer a se 

Without being limited thereto suitable protein disulfide 
isomerases include PDIs described in WO 95/01425 (Novo Nordisk 
5 A/S). 

The polymeric molecule 

The polymeric molecules coupled to the polypeptide may be any 
suitable polymeric molecule, including natural and synthetic homo- 

10 polymers, such as polyols (i.e. poly-OH) , polyamines (i.e. poly- 
NH 2 ) polycarboxyl acids (i.e. poly-COOH) , and further hetero- 

polymers i.e. polymers comprising one or more different coupling 
groups e.g. a hydroxy 1 group and amine groups. 

Examples of suitable polymeric molecules include polymeric 

15 molecules selected from the group comprising polyalkylene oxides 
(PAO) , such as polyalkylene glycols (PAG) , including polyethylene 
glycols (PEG) , raethoxypolyethylene glycols (mPEG) and polypropylen 
glycols, PEG-glycidyl ethers (Epox-PEG) , PEG- oxycarbonyl imidazole 
(CDI-PEG) , Branced PEGs, poly-vinyl alcohol (PVA) , poly- 

20 carboxylates, poly- (viriylpyrolidone) , poly-D,L-amino acids, 
polyethylene-co-maleic acid anhydride, polystyrene-co-malic acid 
anhydr id , dextrans including carboxymethy 1-dextrans , heparin , 
homologous albumin, celluloses, including methylcellulose, 
carboxymethy Icellulose , ethylcellulose, hydroxyethylcellulose 

25 carboxyethy Icellulose and hydroxypropylcellulose, hydrolysates of 
chitosan, starches such as hydroxyethyl-straches and hydroxy 
propyl-starches, glycogen, agaroses and derivates thereof, guar 
gum, pullulan, inulin, xanthan gum, carrageenin, pectin, alginic 
acid hydrolysates and bio-polymers. 

30 Preferred polymeric molecules are non-toxic polymeric molecules 
such as (m) polyethylene glycol ( (m) PEG) which further requires a 
relatively simple chemistry for its covalently coupling to 
attachment groups on the enzyme's surface. 

Generally seen polyalkylene oxides (PAO) , such as polyethylene 

35 oxides, such as PEG and especially mPEG, are the preferred 
polymeric molecules, as these polymeric molecules, in comparison 
to polysaccharides such as dextran, pullulan and the like, have 
few reactive groups capable of cross-linking. 
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Even though all of the above mentioned polymeric molecules may 
be used according to the invention the methoxypolyethylene glycols 
(mPEG) may advantageously be used. This arise from the fact that 
methoxyethylene glycols have only one reactive end capable of 
5 conjugating with the enzyme. Consequently, the risk of cross- 
linking is less pronounced. Further, it makes the product more 
homogeneous and the reaction of the polymeric molecules with the 
enzyme easier to control. 

10 Preparation of enzvme variants 

-Enzyme -variants -to be -conjugated may be constructed by any 
suitable method. A number of methods are well established in 
the art. For instance enzyme variants according to the 
invention may be generated using the same materials and methods 

15 described in e.g. WO 89/06279 (Novo Nordisk A/S) , EP 130,756 
(Genentech) , EP 479,870 (Novo Nordisk A/S), EP 214,435 
(Henkel) , WO 87/04461 (Amgen) , WO 87/05050 (Genex) , EP appli- 
cation no. 87303761 (Genentech), EP 260,105 (Genencor) , WO 
88/06624 (Gist-Brocades NV) , WO 88/07578 (Genentech) , WO 

20 88/08028 (Genex) , WO 88/08033 (Amgen), WO 88/08164 (Genex), 
Thomas et al. (1985) Nature, 318 375-376; Thomas et al. (1987) 
J. Mol. Biol., 193, 803-813; Russel and Fersht (1987) Nature 
328 496-500. 

25 Generation of site directed mutations 

Prior to mutagenesis the gene encoding the polypeptide of 
interest must be cloned in a suitable vector. Methods for 
generating mutations in specific sites is described below. 

Once the polypeptide encoding gene has been cloned, and 

3 0 desirable sites for mutation identified and the residue to 
substitute for the original ones have been decided, these 
mutations can be introduced using synthetic oligonucleotides. 
These oligonucleotides contain nucleotide sequences flanking the 
desired mutation sites; mutant nucleotides are inserted during 

35 oligo-nucleotide synthesis. In a preferred method, Site-directed 
mutagenesis is carried ut by SOE-PCR mutagenesis technique 
described by Kammann et al. (1989) Nucleic Acids Research 17(13), 
5404, and by Sarkar G. and Sommer, S.S. (1990); Biotechniques 8, 
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404-407. 

Activation of polymers 

If the polymeric molecules to be conjugated with the 
5 polypeptide in question are not active it must be activated by the 
use of a suitable technique. It is also contemplated according to 
the invention to couple the polymeric molecules to the polypeptide 
through a linker. Suitable linkers are well-known to the skilled 
person. 

10 Methods and chemistry for activation of polymeric molecules 
as well as for conjugation of polypeptides are intensively 
described in the iTterature. Commonly used methods for activation 
of insoluble polymers include activation of functional groups with 
cyanogen bromide , per iodate , glutaraldehyde , biepoxides , 

15 epichlorohydrin, divinylsulfone, carbodiimide, sulfonyl halides, 
trichlorotriazine etc. (see R.F. Taylor, (1991) , "Protein 
immobilisation. Fundamental and applications", Marcel Dekker, 
N.Y.; S.S. Wong, (1992), "Chemistry of Protein Conjugation and 
Cross 1 inking " , CRC Press, Boca Raton; G.T. Hermanson et al., 

20 (1993), "Immobilized Affinity Ligand Techniques", Academic Press, 
N. Y. ) . Some of the methods concern activation of insoluble 
polymers but are also applicable to activation of soluble polymers 
e.g. periodate, trichlorotriazine, sulf onylhalides, 

divinylsulfone, carbodiimide etc. The functional groups being 

25 amino, hydroxyl, thiol, carboxyl, aldehyde or sulfydryl on the 
polymer and the chosen attachment group on the protein must be 
considered in choosing the activation and conjugation chemistry 
which normally consist of i) activation of polymer, ii) 
conjugation, and iii) blocking of residual active groups. 

30 In the following a number of suitable polymer activation 

methods will be described shortly. However, it is to be understood 
that also other methods may be used. 

Coupling polymeric molecules to the free acid groups of poly- 
peptides may be performed with the aid of diimide and for example 

35 amino-PEG or hydrazino-PEG (Pollak et al., (1976), J. Amr. Chem. 
Soc, 98, 289-291) or diazoacetate/ amide (Wong et al., (1992), 
"Chemistry of Protein Conjugation and Cross linking", CRC Press). 
Coupling polymeric molecules to hydroxy groups are generally 
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very difficult as it must be performed in water. Usually 
hydrolysis predominates over reaction with hydroxyl groups. 

Coupling polymeric molecules to free sulfhydryl groups can be 
reached with special groups like maleimido or the ortho-pyridyl 
5 disulfide. Also vinylsulfone (US patent no. 5,414,135, (1995), 
Snow et al.) has a preference for sulfhydryl groups but is not as 
selective as the other mentioned. 

Accessible Arginine residues in the polypeptide chain may be 
targeted by groups comprising two vicinal carbonyl groups. 

10 Techniques involving coupling electrophilically activated 
PEGs to the amino groups of Lysines may also be useful. Many of 
the usual leaving groups for alcohols give rise to an amine 
linkage. For instance, alkyl sulfonates, such as tresylates 
(Nilsson et al., (1984), Methods in Enzymology vol. 104, Jacoby, 

15 W. B., Ed., Academic Press: Orlando, p. 56-66; Nilsson et al., 
(1987), Methods in Enzymology vol. 135; Mbsbach,K. , Ed.; Academic 
Press: Orlando, pp. 65-79; Scouten et al., (1987), Methods in 
Enzymology vol. 135, Mosbach, K. , Ed. , Academic Press: Orlando, 
1987; pp 79-84; Crossland et al., (1971), J. Amr. Chera. Soc. 1971, 

20 93, pp. 4217-4219), mesylates (Harris, (1985), supra : Harris et 
al., (1984), J. Polym. Sci. Polym. Chem. Ed. 22, pp 341-352), aryl 
sulfonates like tosylates, and para-nitrobenzene sulfonates can be 
used. 

Organic sulfonyl chlorides, e.g. Tresyl chloride, effectively 
25 converts hydroxy groups in a number of polymers, e.g. PEG, into 
good leaving groups (sulfonates) that, when reacted with nucleo- 
philes like amino groups in polypeptides allow stable linkages to 
be formed between polymer and polypeptide. In addition to high 
conjugation yields, the reaction conditions are in general mild 
3 0 (neutral or slightly alkaline pH, to avoid denaturation and little 
or no disruption of activity) , and satisfy the non-destructive re- 
quirements to the polypeptide. 

Tosylate is more reactive than the mesylate but also more un- 
stable decomposing into PEG, dioxane, and sulfonic acid (Zalipsky, 
35 (1995), Bioconjugate Chem., 6, 150-165). Epoxides may also been 
used for creating amine bonds but are much less reactive than the 
above mentioned groups. 

Converting PEG into a chloroformate with phosgene gives rise 
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to carbamate linkages to Lysines. This theme can be played in many 
variants substituting the chlorine with N-hydroxy succinimide (US 
patent no. 5,122,614, (1992); Zalipsky et al., (1992), Biotechnol. 
Appl. Biochem., 15, p. 100-114; Monfardini et al., (1995), Biocon- 
5 jugate Chem. , 6, 62-69, with imidazole (Allen et al., (1991), 
Carbohydr. Res., 213, pp 309-319), with para-nitrophenol, DMAP (EP 
632 082 Al, (1993), Looze, Y.) etc. The derivatives are usually 
made by reacting the chlorof ormate with the desired leaving group. 
All these groups give rise to carbamate linkages to the peptide. 

10 Furthermore, isocyanates and isothiocyanates may be employed 
yielding ureas and thioureas, respectively. 

Amides may be obtained from PEG acids using the same leaving 
groups as mentioned above and cyclic imid thrones (US patent no. 
5,349,001, (1994), Greenwald et al.). The reactivity of these com- 

15 pounds are very high but may make the hydrolysis to fast. 

PEG succinate made from reaction with succinic anhydride can 
also be used. The hereby comprised ester group make the conjugate 
much more susceptible to hydrolysis (US patent no. 5,122,614, 
(1992), Zalipsky). This group may be activated with N-hydroxy suc- 

20 cinimide. 

Furthermore, a special linker can be introduced. The oldest 
being cyanuric chloride (Abuchowski et al., (1977), J. Biol. 
Chem., 252, 3578-3581; US patent no. 4,179,337, (1979), Davis et 
al.; Shafer et al., (1986), J. Polym. Sci. Polym. Chem. Ed., 24, 
25 375-378. 

Coupling of PEG to an aromatic amine followed by diazotation 
yields a very reactive diazonium salt which In situ can be reacted 
with a peptide. An amide linkage may also be obtained by reacting 
an azlactone derivative of PEG (US patent no. 5,321,095, (1994), 
30 Greenwald, R. B.) thus introducing an additional amide linkage. 

As some peptides do not comprise many Lysines it may be 
advantageous to attach more than one PEG to the same Lysine. This 
can be done e.g. by the use of 1, 3-diamino-2-propanol. 

PEGs may also be attached to the amino-groups of the enzyme 
35 with carbamate linkages (WO 95/11924, Greenwald et al.). Lysine 
residues may also be used as the backbone. 

The coupling technique used in the examples is the N- 
succinimidyl carbonate conjugation technique descried in WO 
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90/13590 (Enzon) . 

Method for preparing improved conjugates 

It is also an object of the invention to provide a method for 
5 preparing improved polypeptide-polymer conjugates comprising the 
steps of: 

a) identifying amino acid residues located on the surface of the 
3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
10 structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a suitable 
attachment group, and/ or 

ii) substituting or deleting one or more amino acid residues 
15 selected in step b) at or close to the functional site(s) , 

d) coupling polymeric molecules to the mutated polypeptide. 

Step a) Identifying amino acid residues located on the surface of 
the parent polypeptide 

20 

3-dimensional structure (3D-structure) 

To perform the method of the invention a 3-dimensional 

structure of the parent polypeptide in question is required. 

This structure may for example be an X-ray structure, an NMR 
25 structure or a model-built structure. The Brookhaven Databank 

is a source of X-ray- and NMR-structures . 

A model-built structure may be produced by the person 

skilled in the art if one or more 3D-structure(s) exist (s) of 

homologous polypeptide (s) sharing at least 30% sequence 
3 0 identity with the polypeptide in question. Several software 

packages exist which may be employed to construct a model 

structure. One example is the Homology 95.0 package from 

Biosym. 

Typical actions required for the construction of a model 
35 structure are: alignment of homologous sequences for which 3D- 
structures exist, definition of Structurally Conserved Regions 
(SCRs) , assignment of coordinates to SCRs, search for 
structural fragments/ loops in structure databases to replace 
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Variable Regions, assignment of coordinates to these regions, 
and structural refinement by energy minimization. Regions 
containing large inserts (£3 residues) relative to the known 
3D-structures are known to be quite difficult to model, and 
5 structural predictions must be considered with care. 

Having obtained the 3D-structure of the polypeptide in 
question, or a model of the structure based on homology to 
known structures, this structure serves as an essential 
prerequisite for the fulfillment of the method described below. 

10 

Step b^ Selection of target amino acid residues for mutation 
Target amino acid residues to be mutated are according to 
the invention selected in order to obtain additional or fewer 
attachment groups, such as free amino groups (-NH2) or free 
15 carboxylic acid groups (-COOH) , on the surface of the 

polypeptide and/or to obtain a more complete and broadly spread 
shielding of the epitope(s) on the surface of the polypeptide. 

Conservative substitution 
20 It is preferred to make conservative substitutions in the 

polypeptide, as conservative substitutions secure that the 

impact of the mutation on the polypeptide structure is limited. 
In the case of providing additional amino groups this may be 

done by substitution of Arginine to Lysine, both residues being 
25 positively charged, but only the Lysine having a free amino 

group suitable as an attachment groups. 

In the case of providing additional carboxylic acid groups 

the conservative substitution may for instance be an Aspargine 

to Aspartic acid or Glutamine to Glutamic acid substitution. 
30 These residues resemble each other in size and shape, except 

from the carboxylic groups being present on the acidic 

residues. 

In the case of providing fewer attachment groups, e.g. at or 
close to the active site, a Lysine may be substituted with a 
35 Arginine, and so on. 

Which amino acids to substitute depends in principle on the 
coupling chemistry to be applied. 
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Non-conservative substitution 

The mutation may also be on target amino acid residues which 
are less/non-conservative. Such mutation is suitable for 
obtaining a more complete and broadly spread shielding of the 
5 polypeptide surface than can be obtained by the conservative 
substitutions . 

The method of the invention is first described in general 
terms, and subsequently using specific examples. 

Note the use of the following terms: 
10 Attachment_residue: residue (s) which can bind polymeric 
molecules, e.g. Lysines (amino group) or Aspartic/Glutamic 
acids (carboxylic groups) - N- or C-terminal amino/ carboxy lie 
groups are to be included where relevant. 
Mutation_residue: residue (s) which is to be mutated, e.g. 
15 Arginine or Aspargine/Glutamine. 

Essential_catalytic_residues: residues which are known to be 
essential for catalytic function, e.g. the catalytic triad in 
Serine proteases. 

Solvent_exposed_residues: These are defined as residues which 
20 are at least 5% exposed according to the BIOS YM/ INSIGHT 

algorithm found in the module Homology 95.0. The sequence of 
commands are as follows: 

Homology=>ProStat=>Access_Surf=>Solv_Radius 1.4; Heavy atoms 
only; Radii source VdW; Output: Fractional Area; Polarity 
.25 source: Default. The file f ilename_area. tab is produced. Note: 
For this program to function properly all water molecules must 
first be removed from the structure. 
It looks for example like: 
# PD498FINALMODEL 



30 


# residue 


area 




TRP_1 


136.275711 




SER_2 


88.188095 




PRO_3 


15.458788 




ASN_4 


95.322319 


35 


ASP_5 


4.903404 




PR0_6 


68.096909 




TYRJ7 


93.333252 




TYR 8 


31.791576 
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SER_9 95.983139 
, . continued 

1. Identification of residues which are more than 10 A away 

5 from the closest attachment_residue, and which are located at 
least 8 A away from essential_catalytic_residues. This residue 
subset is called REST, and is the primary region for 
conservative mutation_residue to attachment_residue 
substitutions . 

10 

2. Identification of residues which are located in a 0-5 A 
shell around subset REST, but at least 8 A away from 
essential_catalytic_residues. This residue subset is called 
SUB5B. This is a secondary region for conservative 

15 mutation_residue to attachment_residue substitutions, as a 

ligand bound to an attachment_residue in SUB5B will extend into 
the REST region and potentially prevent epitope recognition. 

3. Identification of solvent_exposed mutation_residues in REST 
20 and SUB5B as potential mutation sites for introduction of 

attachment_residues . 

4. Use BIOSYM/ INSIGHTS Biopolymer module and replace residues 
identified under action 3. 

25 

5. Repeat 1-2 above producing the subset RESTx. This subset 
includes residues which are more than 10 A away from the 
nearest attachment_residue, and which are located at least 8 A 
away from essential catalytic residues, 

30 

6. Identify solvent_exposed_residues in RESTx. These are 
potential sites for less/non-conservative mutations to 
introduce atttachment_residues. 

35 

Step c) Substituting, inserting or deleting amino acid residues 

The mutation (s) performed in step c) may be performed by 
standard techniques well known in the art, such as site-directed 
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mutagenesis (see, e.g., Sambrook et al. (1989), Sambrook et al., 
Molecular Cloning. A Laboratory Manual, Cold Spring Harbor, NY. 

A general description of nucleotide substitution can be found 
in e.g. Ford et al., 1991, Protein Expression and Purification 2, 
5 p. 95-107. 

Step d) Coupling polymeric molecules to the modified parent enzy me 
Polypeptide-polymer conjugates of the invention may be 
prepared by any coupling method known in the art including the 
10 above mentioned techniques. 

Coupling of polymeric molecules to the polypeptide in question 

If the polymeric molecules to be conjugated with the 
polypeptide are not active it must be activated by the use of a 

15 suitable method. The polymeric molecules may be coupled to the 
polypeptide through a linker. Suitable linkers are well known to 
the skilled person. 

Methods and chemistry for activation of polymeric molecules as 
well as for conjugation of polypeptides are intensively described 

20 in the literature. Commonly used methods for activation of 
insoluble polymers include activation of functional groups with 
cyanogen bromide , per iodate , glutaraldehyde , biepoxides , 
epichlorohydrin, divinylsulfone, carbodiimide, sulfonyl halides, 
trichlorotriazine etc. (see R.F. Taylor, (1991), "Protein 

25 immobilisation. Fundamental and applications", Marcel Dekker, 
N.Y.; S.S. Wong, (1992), "Chemistry of Protein Conjugation and 
Crosslinking", CRC Press, Boca Raton; G.T. Hermanson et al., 
(1993), "Immobilized Affinity Ligand Techniques", Academic Press, 
N.Y.). Some of the methods concern activation of insoluble 

30 polymers but are also applicable to activation of soluble polymers 
e.g. periodate, trichlorotriazine, sulfonylhalides, 

divinylsulfone, carbodiimide etc. The functional groups being 
amino, hydroxyl, thiol, carboxyl, aldehyde or sulfydryl on the 
polymer and the chosen attachment group on the protein must be 

3 5 considered in choosing the activation and conjugation chemistry 
which normally consist of i) activation of polymer, ii) 
conjugation, and iii) blocking of residual active groups. 

In the following a number of suitable polymer activation 
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methods will be described shortly. However, it is to be understood 
that also other methods may be used. 

Coupling polymeric molecules to the free acid groups of enzymes 
can be performed with the aid of diimide and for example amino-PEG 
5 or hydrazino-PEG (Pollak et al., (1976) , J. Amr. Chem. Soc. , 98, 
289-291) or diazoacetate/ amide (Wong et al., (1992), "Chemistry of 
Protein Conjugation and Crosslinking", CRC Press). 

Coupling polymeric molecules to hydroxy groups are generally 
very difficult as it must be performed in water. Usually 

10 hydrolysis predominates over reaction with hydroxyl groups. 

Coupling polymeric molecules to free sulfhydryl groups can be 
reached with special groups like maleimido or the ortho-pyridyl 
disulfide. Also vinylsulfone (US patent no. 5,414,135, (1995), 
Snow et al.) has a preference for sulfhydryl groups but is not as 

15 selective as the other mentioned. 

Accessible Arginine residues in the polypeptide chain may be 
targeted by groups comprising two vicinal carbonyl groups. 

Techniques involving coupling electrophilically activated PEGs 
to the amino groups of Lysines are also be useful. Many of the 

20 usual leaving groups for 'alcohols give rise to an amine linkage. 
For instance, alkyl sulfonates, such as tresylates (Nilsson et 
al., (1984), Methods in Enzymology vol. 104, Jacoby, W. B., Ed., 
Academic Press: Orlando, p. 56-66; Nilsson et al., (1987), Methods 
in Enzymology vol. 135; Mosbach, K., Ed.; Academic Press: Orlando, 

25 pp. 65-79; Scouten et al., (1987), Methods in Enzymology vol. 135, 
Mosbach, K. , Ed., Academic Press: Orlando, 1987; pp 79-84; 
Crossland et al., (1971), J. Amr. Chem. Soc. 1971, 93, pp. 4217-4- 
219), mesylates (Harris, (1985), supra ; Harris et al., (1984), J. 
Polym. Sci. Polym. Chem. Ed. 22, pp. 341-352), aryl sulfonates 

30 like tosylates, and para-nitrobenzene sulfonates can be used. 

Organic sulfonyl chlorides, e.g. Tresyl chloride, effectively 
converts hydroxy groups in a number of polymers, e.g. PEG, into 
good leaving groups (sulfonates) that, when reacted with 
nucleophiles like amino groups in polypeptides allow stable 

35 linkages to be formed between polymer and polypeptide. In addition 
to high conjugation yields, the reaction conditions are in general 
mild (neutral or slightly alkaline pH, to avoid denaturation and 
little or no disruption of activity) , and satisfy the non- 
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destructive requirements to the polypeptide. 

Tosylate is more reactive than the mesylate but also more 
unstable decomposing into PEG, dioxane, and sulfonic acid 
(Zalipsky, (1995), Bioconjugate Chem. , 6, 150-165). Epoxides may 
5 also been used for creating amine bonds but are much less reactive 
than the above mentioned groups. 

Converting PEG into a chloroformate with phosgene gives rise to 
carbamate linkages to Lysines- This theme can be played in many 
variants substituting the chlorine with N-hydroxy succinimide (US 
10 patent no. 5,122,614, (1992); Zalipsky et al., (1992) , Biotechnol. 
Appl. Biochem., 15, p. 100-114; Monfardini et al., (1995), 
Bioconjugate Chem., 6, 62-69, with imidazole (Allen et al., 

(1991) , Carbohydr. Res., 213, pp 309-319), with para-nitrophenol , 
DMAP (EP 632 082 Al, (1993), Looze, Y.) etc. The derivatives are 

15 usually made by reacting the chloroformate with the desired 
leaving group. All these groups give rise to carbamate linkages to 
the peptide. 

Furthermore, isocyanates and isothiocyanates may be employed 
yielding ureas and thioureas, respectively. 
20 Amides may be obtained from PEG acids using the same leaving 
groups as mentioned above and cyclic imid thrones (US patent no. 
5,349, 001 , ( 1994 ) , Greenwald et al . ) . The reactivity of these 
compounds are very high but may make the hydrolysis to fast. 

PEG succinate made from reaction with succinic anhydride can 
25 also be used. The hereby comprised ester group make the conjugate 
much more susceptible to hydrolysis (US patent no. 5,122,614, 

(1992) , Zalipsky). This group may be activated with N-hydroxy 
succinimide. 

Furthermore, a special linker can be introduced. The oldest 
30 being cyanuric chloride (Abuchowski et al., (1977), J. Biol. 
Chem., 252, 3578-3581; US patent no. 4,179,337, (1979), Davis et 
al.; Shafer et al., (1986), J. Polym. Sci. Polym. Chem. Ed., 24, 
375-378. 

Coupling of PEG to an aromatic amine followed by diazotation 
35 yields a very reactive diazonium salt which in situ can be reacted 
with a peptide. An amide linkage may also be obtained by reacting 
an azlact ne derivative of PEG (US patent no. 5,321,095, (1994), 
Greenwald, R. B.) thus introducing an additional amide linkage. 
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As some peptides do not comprise many Lysines it may be advan- 
tageous to attach more than one PEG to the same Lysine. This can 
be done e.gr. by the use of 1, 3-diamino-2-propanol. 

PEGs may also be attached to the amino-groups of the enzyme 
5 with carbamate linkages (WO 95/11924, Greenwald et al.). Lysine 
residues may also be used as the backbone. 

Addition of attachment groups 

Specific examples of PD498 variant-SPEG conjugates 
10 A specific example of a protease is the parent PD498 (WO 
93/24623 and SEQ ID NO. 2). The parent PD498 has a molecular 
weight of 29 kua. 

Lysine and Arginine residues are located as follows: 



Distance from the 


Arginine 


Lysine 


active site 






0-5 A 


l 




5-10 A 






10-15 A 


5 


6 


15-20 A 


2 


3 


20-25 A 


1 


3 


total 


9 


12 



15 The inventors examined which parent PD498 sites on the surface 
may be suitable for introducing additional attachment groups. 

A. Suitable conservative Arginine to Lysine substitutions in 
parent PD498 may be any of R51K, R62K, R121K, R169K, R250K, R28K, 
R190K. 

20 B. Suitable non -conservative substitutions in parent PD498 may 
be any of P6K, Y7K, S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, 
N65K, G87K, I88K, N209K, A211K, N216K, N217K, G218K, Y219K, 
S220K, Y221K, G262K. 

As there is no Lysine residues at or close to the active site 

25 there is no need for removing any attachment group. 

PD498 variant-SPEG conjugates may be prepared using any of the 
above mentioned PD498 variants as the starting material by any 
conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. A specific example is 

30 described below. 
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Removal of attachment groups 

Specific examples of BPtT variant-SPEG conjugates 

A specific example of a protease having an attachment group in 
5 the active site is BPN" which has 11 attachment groups (plus an N- 
terminal amino group) : BPN' has a molecular weight of 28 kDa. 

Lysine and Arginine residues are located as follows: 



Distance from 


Arginine 


Lysine 


the active site 






0-5 A 




1 


5-10 A 






10-15 A 


1 


4 


15-20 A 


1 


4 


20-25 A 




2 


total 


2 


11 



10 The Lysine residue located within 0-5 A of the active site can 

according to the invention advantageously be removed. Specifically 

this may be done by a K94R substitution. 

BPN' variant-SPEG conjugates may be prepared using the above 

mentioned BP*T variant as the starting material by any conjugation 
15 technique known in the art for coupling polymeric molecules to 

amino groups on the enzyme. 

Addition and removal of attachment groups 

SpecjLf Xc example pf gavinasea-SPgG -conjugates 
20 As described in Example 2 parent Savinase® (von der Osten et 

al., (1993), Journal of Biotechnology, 28, p. 55+ and SEQ ID NO. 

3) may according to the invention have added a number of amino 

attachment groups to the surface and removed an amino attachment 

group close to the active site. 
25 Any of the following substitutions in the parent Savinase® 

are sites for mutagenesis: R10K, R19K, R45K, R145K, R170K, 

R186K and R247K. 

The substitution K94R are identified as a mutation suitable 

for preventing attachment of polymers close to active site. 
30 Savinase® variant-SPEG conjugates may be prepared using any of 
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the above mentioned Savinase® variants as the starting material by 
any conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. 

5 Addition of attachment groups 

A specific examples of Humicola lanuginosa lipase variants-SPEG 
conjugates 

Specific examples of lipase variants with reduced 
immunogenicity using the parent Huminocal lanuginosa DSM 4109 
10 lipase (see SEQ ID No 6) as the backbone for substitutions are 
listed below. 

The parent unmodified Humicola lanuginosa lipase has 8 
attachment groups including the N- terminal NH 2 group and a 
molecular weight of about 29 kDa. 
15 A. Suitable conservative Arginine to Lysine substitutions in the 
parent lipase may be any of R133K, R139K, R160K, R179K, R209K, 
R118K and R125K. 

Suitable non-conservative substitutions in the parent lipase 
may be any of: 

20 A18K / G31K # T3 2K f N33K / G38K # A40K / D48K,T50K,E56K,D57K / S58K,G59K, 
V60K,G61K / D62K,T64K,L78K f N88K,G91K / N92K,L93K,S105K,G106K, 
V120K,P136K, G225K,L227K,V228K,P229K,P250K,F262K. 

Further suitable non-conservative substitution in the Humicola 
lanuginosa lipase include: E87K or D254K. 

25 Lipase variant-SPEG conjugates may be prepared using any of the 
above mentioned lipase variants as the starting material by any 
conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. A specific example is 
described below. 

30 In Example 12 below is it shown that a conjugate of the 
Humicola lanuginosa lipase variant with a E87K+D254K substitutions 
coupled to S-PEG 15,000 has reduced immunogenic response in Balb/C 
mice in comparison to the corresponding parent unmodified enzyme. 

35 Immunogenicity and Alleraenicitv 

"Immunogenicity" is a wider term than "antigenicity" and 
"allergenicity", and expresses the immune system's response to the 
presence of foreign substances. Said foreign substances are called 
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immunogens, antigens and allergens depending of the type of immune 
response the elicit. 

An "immunogen" may be defined as a substance which, when intro- 
duced into circulatory system of animals and humans, is capable of 
5 stimulating an immunologic response resulting in formation of 
immunoglobulin . 

The term "antigen" refers to substances which by themselves are 
capable of generating antibodies when recognized as a non-self 
molecule. 

10 Further, an "allergen" may be defined as an antigen which may 
give rise to allergic sensitization or an allergic response by IgE 

antibodies {in humans, and molecules -with comparable effects in 
animals) . 

15 Assessment of immunoaencitv 

Assessment of the immunogenicity may be made by injecting 
animal subcutaneously to enter the immunogen into the circulation 
system and comparing the response with the response of the 
corresponding parent polypeptide. 

20 The "circulatory system" of the body of humans and animals 
means, in the context of the present invention, the system which 
mainly consists of the heart and blood vessels. The heart delivers 
the necessary energy for maintaining blood circulation in the 
vascular system. The circulation system functions as the 

25 organism^ transportation system, when the blood transports 0 2 , 
nutritious matter, hormones, and other substances of importance 
for the cell regulation into the tissue. Further the blood removes 
C0 2 from the tissue to the lungs and residual substances to e.g. 
the kidneys. Furthermore, the blood is of importance for the 

30 temperature regulation and the defence mechanisms of the body, 
which include the immune system. 

A number of in vitro animal models exist for assessment of the 
immunogenic potential of polypeptides. Some of these models give a 
suitable basis for hazard assessment in man. Suitable models 

35 include a mice model. 

This model seek to identify the immunogenic response in the 
form of the IgG response in Balb/C mice being injected 
subcutaneously with modified and unmodified polypeptides. 
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Also other animal models can be used for assessment of the 
immunogenic potential. 

A polypeptide having "reduced immunogenicity" according to the 
invention indicates that the amount of produced antibodies, e.g. 
5 immunoglobulin in humans, and molecules with comparable effects in 
specific animals, which can lead to an immune response, is 
significantly decreased, when introduced into the circulatory 
system, in comparison to the corresponding parent polypeptide. 

For Balb/C mice the IgG response gives a good indication of the 
10 immunigenic potential of polypeptides. 

Assessment of alleraenicitv 

Assessment of allergenicity may be made by inhalation tests, 
comparing the effect of intratracheal ly (into the trachea) 

15 administrated parent enzymes with the corresponding modified 
enzymes according to the invention. 

A number of in vivo animal models exist for assessment of the 
allegenicity of enzymes. Some of these models give a suitable 
basis for hazard assessment in man. Suitable models include a 

20 guinea pig model and a mouse model. These models seek to identify 
respiratory allergens as a function of elicitation reactions 
induced in previously sensitised animals. According to these 
models the alleged allergens are introduced intratracheal ly into 
the animals. 

25 A suitable strain of guinea pigs, the Dunkin Hartley strain, do 
not as humans, produce IgE antibodies in connection with the 
allergic response. However, they produce another type of antibody 
the IgGIA and IgG IB (see e.g. Prento, ATLA, 19, p. 8-14, 1991), 
which are responsible for their allergenic response to inhaled 

30 polypeptides including enzymes. 1 Therefore, when using the Dunkin 
Hartley animal model, the relative amount of IgGIA and IgGlB is a 
measure of the allergenicity level. 

The Balb/C mice strain is suitable for intratracheal exposure. 
Balb/C mice produce IgE as the allergic response. 

35 More details on assessing respiratory allergens in guinea pigs 
and mice is described by Kimber t al.,(1996), Fundamental and 
Applied Toxicology, 33, p. 1-10. 
Other animals such as rats, rabbits etc. may also be used for 
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comparable studies. 
Composition 

The invention relates to a composition comprising a 
5 polypeptide-polymer conjugate of the invention. 

The composition may be a pharmaceutical or industrial 
composition. 

The composition may further comprise other polypeptides, 
proteins or enzymes and/or ingredients normally used in e.g. 

10 detergents, including soap bars, household articles, 
a oroc h emicals , personal care products, including skin care 
compositions, cleaning compositions for e.g. contact lenses, oral 
and dermal pharmaceuticals, composition use for treating textiles, 
compositions used for manufacturing, food, e.g. baking, and feed 

15 etc. 

Use of the polypeptide-polymer conjugate 

The invention also relates to the use of the method of the 
invention for reducing the immune response of polypeptides. 
20 It is also an object of the invention to use the polypeptide- 
polymer conjugate of the invention to reduce the allergenicity of 
industrial products, such as detergents, such as laundry, disk 
wash and hard surface cleaning detergents, and food or feed 
products . 

25 

MATERIAL AND METHODS 
Materials 

Enzymes : 

PD498: Protease of subtilisin type shown in WO 93/24623. The 
30 sequence of PD498 is shown in SEQ ID NO. 1 and 2. 
Savinase® (Available from Novo Nordisk A/S) 

Humicola lanuginosa lipase: Available from Novo Nordisk as 
lipolase® and is further described in EP 305,216. The DNA and 
protein sequence is shown in SEQ ID NO 5 and 6, respectively. 
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Strains: 

B . subtilis 309 and 147 are variants of Bacillus lentus, 
deposited with the NCIB and accorded the accession numbers NCIB 
5 10309 and 10147, and described in US Patent No. 3 , 723 , 250 
incorporated by reference herein - 

E. coli MC 1000 (M.J. Casadaban and S.N. Cohen (1980); J . 
Mol. Biol. 138 179-207), was made r~,m + by conventional methods 
and is also described in US Patent Application Serial No. 
10 039,298. 

Vectors : 

pPD498: E. coli - B. subtilis shuttle vector (described in 
US patent No. 5,621,089 under section 6.2.1.6) containing the 
15 wild-type gene encoding for PD498 protease (SEQ ID NO. 2). The 
same vector is use for mutagenesis in E. coli as well as for 
expression in B . subtilis. 

General molecular biolocrv methods: 

20 Unless otherwise mentioned the DNA manipulations and 

transformations were performed using standard methods of 
molecular biology (Sambrook et al. (1989) Molecular cloning: A 
laboratory manual, Cold Spring Harbor lab., Cold Spring Harbor, 
NY; Ausubel, F. M. et al. (eds.) "Current protocols in 

25 Molecular Biology". John Wiley and Sons, 1995; Harwood, C. R. , 
and Cutting, S. M. (eds.) "Molecular Biological Methods for 
Bacillus". John Wiley and Sons, 1990) . 

Enzymes for DNA manipulations were used according to the 
specifications of the suppliers. 

30 

Materials, chemicals and solutions: 

Horse Radish Peroxidase labeled anti-rat-Ig (Dako, DK, P162, # 
031; dilution 1:1000). 
35 Mouse anti-rat IgE (Serotec MCA193; dilution 1:200). 
Rat anti-mouse IgE (Serotec MCA419; dilution 1:100). 
Biotin-labeled mouse anti-rat IgGl monoclonal antibody (Zymed 03- 
9140; dilution 1:1000) 
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Biotin- labeled rat anti-mouse IgGl monoclonal antibody (Serotec 
MCA336B; dilution 1:1000) 

Streptavidin-horse radish peroxidase (Kirkegard & Perry 14-30-00; 
dilution 1:1000). 
5 CovaLink NH 2 plates (Nunc, Cat# 459439) 
• Cyanuric chloride (Aldrich) 
Acetone ( Merck ) 

Rat anti-Mouse IgGl, biotin (SeroTec, Cat# MCA336B) 
Streptavidin, peroxidase (KPL) 
10 Ortho-Phenylene-diamine (OPD) (Kem-en-Tec) 
H 2 0 2 , 30% (Merck) 
Tween 20 (Merck) 
Skim Milk powder (Difco) 
H 2 S0 4 (Merck) 

15 

Buffers and Solutions: 

Carbonate buffer (0.1 M, pH 10 (1 liter)) Na 2 C0 3 10.60 g 

PBS (pH 7.2 (1 liter)) NaCl 8.00 g 

KC1 0.20 g 

20 # K 2 HP0 4 1.04 g 

KH 2 P0 4 0.32 g 

Washing buffer PBS, 0.05% (v/v) Tween 20 
Blocking buffer PBS, 2% (wt/v) Skim Milk powder 

Dilution buffer PBS, 0.05% (v/v) Tween 20, 0.5% (wt/v) Skim Milk 
25 powder 

Citrate buffer (0.1M, pH 5.0-5.2 (1 liter) )NaCitr ate 20.60 g 

Citric acid 6.30 g 

Activation of CovaLink plates: 

• Make a fresh stock solution of 10 mg cyanuric chloride per ml 
30 acetone. 

• Just before use, dilute the cyanuric chloride stock solution 
into PBS, while stirring, to a final concentration of Img/ml. 

• Add 100 ml of the dilution to each well of the CovaLink NH2 
plates, and incubate for 5 minutes at room temperature. 

35 ■ Wash 3 times with PBS. 

• Dry the freshly prepared activated plates at 50 °C for 30 
minutes. 

• Immediately seal each plate with sealing tape. 
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• Preactivated plates can be stored at room temperature for 3 
weeks when kept in a plastic bag. 

Sodium Borate, borax (Sigma) 
5 3,3-Dimethyl glutaric acid (Sigma) 
CaCl2 (Sigma) 

Tresyl chloride (2,2,2-trif louroethansulfonyl chloride) (Fluka) 
l-ethyl-3- ( 3-dimethylaminopropyl) carbodiimide (EDC) (Fluka) 
N-Hydroxy succinimide (Fluka art, 56480) ) 
10 Phosgene (Fluka art. 79380) 
Lactose (Merck 7656) 

PMSF (phenyl methyl sulf bhyl f louride) from Sigma" 
Succinyl-Alanine-Alanine-Proline-Phenylalanine-para-nitroanilide 
(Suc-AAPF-pNP) Sigma no. S-7388, Mw 624.6 g/mole. 

15 

Colouring substrate; 

OPD: o-phenylene-diamine, (Kementec cat no. 4260) 
Test Animals: 

20 Dunkin Hartley guinea pi§s (from Charles River, DE) 

Female Balb/C mice (about 20 grams) purchased from Bomholdtgaard, 
Ry, Denmark. 

Equipment : 
25 XCEL II (Novex) 

ELISA reader (UVmax, Molecular Devices) 
HPLC (Waters) 
PFLC (Pharmacia) 

Superdex-75 column, Mono-Q, Mono S from Pharmacia, SW. 
3 0 SLT: Fotometer from SLT Lablnstruments 

Size-exclusion chromatograph (Spherogel TSK-G2000 SW) . 
Size-exclusion chromatograph (Superdex 200, Pharmacia, SW) 
Amicon Cell 



35 Enzymes for DNA manipulations 

Unless otherwise mentioned all enzymes for DNA 
manipulations, such as e.g. restriction endonucleases, ligases 
etc., are obtained from New England Biolabs. Inc. 
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Methods 

ft.tsa proce dure for determination of IgG^ positive guinea pigs 

ELISA microtiter plates are coated with rabbit anti-PD498 
5 1:8000 in carbonate buffer and incubated over night at 4°C. The 
next day the plates is blocked with 2% BSA for 1 hour and washes 3 
times with PBS Tween 20. 

1 ug/ml PD498 is added to the plates and incubated for 1 hour, 
then washed 3 times with PBS Tween 20. 
10 All guinea pig sera samples and controls are applied to the 

-ELISA -plates .with. 2 fil .sera .and .9.8 ul PBS. incubated f or 1 hour 
and washed 3 times with PBS Tween 20. 

Then goat anti-guinea pig IgG x (1:4000 in PBS buffer (Nordic 
Immunology 44-682)) is applied to the plates, incubated for 1 hour 
15 and washed with PBS tween 20. 

Alkaline phosphatase marked rabbit anti-goat 1:8000 (Sigma 
A4187) is applied and incubated for 1 hour, washed 2 times in PBS 
Tween20 and 1 time with diethanol amine buffer. 

The marked alkaline phosphatase is developed using p- 
20 nitrophenyl phosphate for 30 minutes at 37°C or until appropriate 
colour has developed. 

The reaction is stopped using Stop medium (K 2 HP04/HaH 3 buffer 
comprising EDTA (pH 10)) and read at OD 405/650 using a ELISA 
reader . 

25 Double blinds are included on all ELISA plates. 

Positive and negative sera values are calculated as the 
average blind values added 2 times the standard deviation. This 
gives an accuracy of 95%. 

30 Determination of the molecule weight 

Electrophoretic separation of proteins was performed by standard 
methods using 4-20% gradient SDS poly acrylamide gels (Novex) . 
Proteins were detected by silver staining. The molecule weight was 
measured relative to the mobility of Mark-12® wide range molecule 

35 weight standards from Novex. 



Protease activity 
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Analysis with Suc-A^a-^^^-pro-Pjie-pNa: 

Proteases cleave the bond between the peptide and p- 
nitroaniline to give a visible yellow colour absorbing at 405 nm. 

Buffer: e.gr. Britton and Robinson buffer pH 8.3 
5 Substrate: 100 mg suc-AAPF-pNa is dissolved into 1 ml dimethyl 
sulfoxide (DMSO) . 100 jil of this is diluted into 10 ml with 
Britton and Robinson buffer. 

The substrate and protease solution is mixed and the 
absorbance is monitored at 405 nm as a function of time and ABS405 
10 nm/min. The temperature should be controlled (20-50°C depending on 
protease) . This is a m easu re of the protease activity in the 
sample . 

Proteolytic Activity 

15 In the context of this invention ^proteolytic activity is 

expressed in Kilo NOVO Protease Units (KNPU) • The activity is 
determined relatively to an enzyme standard (SAVTNASE_) , and 
the determination is based on the digestion of a dimethyl 
casein (DMC) solution by the proteolytic enzyme at standard 

20 conditions, i.e. 50°C, pH 8.3, 9 min. reaction time, 3 min. 
measuring time. A folder AF 220/1 is available upon request to 
Novo Nordisk A/S, Denmark, which folder is hereby included by 
reference. 

A GU is a Glycine Unit, defined as the proteolytic enzyme 
25 activity which, under standard conditions, during a 15-minutes 1 
incubation at 40°C, with N-acetyl casein as substrate, produces 
an amount of NH2 -group equivalent to 1 mmole of glycine. 

Enzyme activity can also be measured using the PNA assay, 
according to reaction with the soluble substrate succinyl- 
3 0 alanine-alanine-proline-pheny 1-alanine-para-nitrophenol , which 
is described in the Journal of American Oil Chemists Society, 
Rothgeb, T.M. , Goodlander, B.D., Garrison, P.H., and Smith, 
L.A. , (1988) . 

35 Fermentation of PD498 variants 

Fermentation of PD498 variants in B . suJbtilis are performed 
at 30°C on a rotary shaking table (300 r.p.m.) in 500 ml baffled 
Erlenmeyer flasks containing 100 ml BPX medium for 5 days. In 
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order to make an e.g. 2 liter broth 20 Erlenmeyer flasks are 
f ermented s imultaneous ly . 

Media: 

5 BPX: Composition (per liter) 



Sodium caseinate lOg 

The starch in the medium is liquefied with a-amylase and 
the medium is sterilized by heating at 120°C for 45 minutes. 
After sterilization the pH of the medium is adjusted to 9 by 
15 addition of NaHC0 3 to 0.1 M. 

Purification of PD498 variants 

Approximately 1.6 litres of PD498 variant fermentation 
broth are centrifuged at 5000 rpm for 35 minutes in 1 litre 

2 0 beakers. The supernatants are adjusted to pH 7.0 using 10% 
acetic acid and filtered on Seitz Supra S100 filter plates. 
The filtrates are concentrated to approximately 400 ml using an 
Amicon CH2A UF unit equipped with an Amicon S1Y10 UF cartridge. 
The UF concentrate is centrifuged and filtered prior to 

25 absorption at room temperature on a Bacitracin affinity column 
at pH 7. The PD498 variant is eluted from the Bacitracin column 
at room temperature using 25% 2-propanol and 1 M sodium 
chloride in a buffer solution with 0.01 dime-thyl-glutaric 
acid, 0.1 M boric acid and 0.002 M calcium chloride adjusted to 

30 pH 7. 

The fractions with protease activity from the Bacitracin 
purification step are combined and applied to a 750 ml Sephadex 
G25 column (5 cm diameter) equilibrated with a buffer 
containing 0.01 dimethylglutaric acid, 0.1 M boric acid and 
35 0.002 M calcium chloride adjusted to pH 6.0. 

Fractions with proteolytic activity from the Sephadex G25 
column are combined and applied to a 150 ml CM Sepharose CL 6B 
cat-ion exchange column (5 cm diameter) equilibrated with a 



Potato starch 



lOOg 



10 



Ground barley 
Soybean flour 
Na 2 HP0 4 X 12 H 2 0 
Pluronic 



50g 
20g 
9g 

O.lg 
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buffer containing 0,01 M dimethylglutaric acid, 0.1 M boric 
acid, and 0.002 M calcium chloride adjusted to pH 6.0. 
The protease is eluted using a linear gradient of 0-0.5 M 
sodium chloride in 1 litres of the same buffer. 
5 Protease containing fractions from the CM Sepharose column are 
combined and filtered through a 2\i filter. 



10 



15 



20 



25 



30 



35 



Balb/C mice IaG ELISA Procedure: 

The antigen is diluted to 1 mg/ml in carbonate buffer. 
100 ml is added to each well. 
The plates are coated overnight at 4°C. 

Unspecific adsorption is blocked by incubating each well for 1 
hour at room temperature with 200 ml blocking buffer. 
The plates are washed 3x with 300 ml washing buffer. 
Unknown mouse sera are diluted in dilution buffer, typically 
lOx, 2 Ox and 4 Ox, or higher. 
100 ml is added to each well. 
Incubation is for 1 hour at room temperature. 
Unbound material is removed by washing 3x with washing buffer. 
The anti-Mouse IgGl antibody is diluted 2000x in dilution 
buffer. 

100 ml is added to each well. 
Incubation is for 1 hour at room temperature. 
Unbound material is removed by washing 3x with washing buffer. 
Streptavidine is diluted lOOOx in dilution buffer. 
100 ml is added to each well. 
Incubation is for 1 hour at room temperature. 

Unbound material is removed by washing 3x with 300 ml washing 
buffer. 

OPD (0.6 mg/ml) and H 2 0 2 (0.4 ml/ml) is dissolved in citrate 
buffer. 

100 ml is added to each well. 

Incubation is for 10 minutes at room temperature. 
The reaction is stopped by adding 100 ml H 2 S0 4 . 
The plates are read at 492 nm with 620 nm as reference. 



Immunisation of mice 

Balb/C mice (20 grams) are immunised 10 times (intervals of 14 
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days) by subcutaneous injection of the modified or unmodified 
polypeptide in question, respectively by standard proceedures 
known in art. 

5 EXAMPLES 
Example 1 

suitable substit utions in PD498 for addition of amino 
10 attachment groups f-NH z j 

The 3D structure of parent PD498 was modeled as described 
above based on 59% sequence identity with Thermitase® 
(-2tec.pdb) . 

The sequence of PD498 is (see SEQ ID NO. 2) . PD498 residue 
15 numbering is used, 1-280. 

The commands performed in Insight (BIOSYM) are shown in the 
command files makeKzone.bcl and makeKzone2 .bcl below: 

Conservative substitutions: 

20 makeKzone.bcl 

1 Delete Subset * 

2 Color Molecule Atoms * Specified Specification 55,0,255 

3 Zone Subset LYS :lys:NZ Static monomer/ residue 10 
Color_Subset 255,255,0 

25 4 Zone Subset NTERM :1:N Static monomer /residue 10 
Color_Subset 255,255,0 

5 #N0TE: editnextline ACTSITE residues according to the 
protein 

6 Zone Subset ACTSITE : 39, 72 ,226 Static monomer /residue 8 
30 Color_Subset 255,255,0 

7 Combine Subset ALLZONE Union LYS NTERM 

8 Combine Subset ALLZONE Union ALLZONE ACTSITE 

9 #N0TE: editnextline object name according to the protein 

10 Combine Subset REST Difference PD498FINALM0DEL ALLZONE 
35 11 List Subset REST Atom Output File restatom. list 

12 List Subset REST monomer /residue Output_File restmole. list 

13 Color Molecule Atoms ACTSITE Specified Specification 255,0,0 

14 List Subset ACTSITE Atom Output File actsiteatom. list 

15 List Subset ACTSITE monomer/ residue Output_File 
40 actsitemole . list 

16 # 

17 Zone Subset REST5A REST Static Monomer /Residue 5 - 
Color_Subset 

18 Combine Subset SUB 5 A Difference REST5A ACTSITE 
45 19 Combine Subset SUB5B Difference SUB5A REST 

20 Color Molecule Atoms SUB5B Specified Specification 
255,255,255 

21 List Subset SUB5B Atom Output File subSbatom. list 

22 List Subset SUB5B monomer /residue Output_File sub5bmole. list 
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23 #Now identify sites for lys->arg substitutions and continue 
with makezone2 .bcl 

24 #Use grep command to identify ARG in restatom. list, 
subSbatom. list & accsiteatom. list 

5 

Comments : 

Lines 1-8: The subset ALL ZONE is defined as those residues 
which are either within 10 A of the free amino groups on 
lysines or the N- terminal, or within 8 A of the catalytic triad 
10 residues 39, 72 and 226. 

Line 10: The subset REST is defined as those residues not 
included in ALLZONE. 

L ines 17-20: S ubse t SUB5B is defined as those residues in a 
5 A shell around REST, excluding residues within 8 A of the 
15 catalytic residues. 

Line 23-24: REST contains Arg62 and Argl69, SUB5B contains 
Arg51, Argl21, and Arg250. ACTSITE contains Argl03, but 
position 103 is within 8 A from essential_catalytic_residues, 
and thus not relevant. 
20 The colour codes are: (255,0,255) = magenta, 

(255,255, 0)yellow, (255,0,0) red, and (255, 255, 255)= white. 

The substitutions R51K, R62K, R121K, R169K and R250K are 
identified in parent PD498 as suitable sites for mutagenesis. 
The residues are substituted below in section 2, and further 
25 analysis done: 



Non-conservative substitutions: 
maXeKzone2 . bcl 

I #sourcefile makezone2.bcl Claus von der Osten 961128 
30 2 # 

3 #having scanned lists (grep arg command) and identified 
sites for lys->arg substitutions 

4 #NOTE: editnextline object name according to protein 

5 Copy Object -To_Clipboard -Displace PD4 9 8FINALM0DEL 
35 newmodel 

6 Biopolymer 

7 #N0TE: editnextline object name according to protein 

8 Blank Object On PD498FINALMODEL 

9 #N0TE: editnextlines with lys->arg positions 
40 10 Replace Residue newmodel: 51 lys L 

II Replace Residue newmodel: 62 lys L 

12 Replace Residue newmodel: 121 lys L 

13 Replace Residue newmodel: 169 lys L 

14 Replace Residue newmodel: 250 lys L 
45 15 # 
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16 #Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

17 Color Molecule Atoms newmodel Specified Specification 
255,0,255 

5 18 Zone Subset LYSx newmodel: lys:NZ Static monomer/ residue 10 
Color_Subset 255,255,0 

19 Zone Subset NTERMx newmodel: 1:N Static monomer /residue 10 
Color_Subset 255,255,0 

20 #NOTE: editnextline ACTSITEx residues according to the 
10 protein 

21 Zone Subset ACTSITEx newmodel: 39 , 72 , 226 Static 
monomer /residue 8 Color_Subset 255,255,0 

22 Combine Subset ALLZONEx Union LYSx NTERMx 

23 Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
15 24 Combine Subset RESTx Difference newmodel ALLZONEx 

25 List Subset RESTx Atom Output^File restxatom. list 

2 6 List Subset RESTx monomer/ residue Gutput_File 
restxmole. list 

27 # 

20 28 Color Molecule Atoms ACTSITEx Specified Specification 
255,0,0 

29 List Subset ACTSITEx Atom Output^File actsitexatom. list 

3 0 List Subset ACTSITEx monomer /residue Output_File 
actsitexmole . list 

25 31 # 

32 #read restxatom. list or restxmole, list to identify sites 
for (not_arg)->lys subst. if needed 

Comments : 

30 Lines 1-15: Solvent exposed arginines in subsets REST and 

SUB5B are replaced by lysines. Solvent accessibilities are 

recalculated following arginine replacement. 

Lines 16-23: The subset ALLZONEx is defined as those 

residues which are either within 10 & of the free amino groups 
3 5 on Lysines (after replacement) or the N-terminal, or within 8 A 

of the catalytic triad residues 39, 72 and 226. 

Line 24-26: The subset RESTx is defined as those residues 

not included in ALLZONEx, i.e. residues which are still 

potential epitope contributors. Of the residues in RESTx, the 
40 following are >5% exposed (see lists below): 6-7,9-12,43- 

45,65,87-88,209,211,216-221,262. 

The following mutations are proposed in parent PD498: P6K, 

Y7K, S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, N65K, G87K, I88K, 

N209K, A211K, N216K, N217K, G218K, Y219K, S220K, Y221K, G262K. 
45 Relevant data for Example 1: 

Solvent accessibility data for PD498MODEL: 

# PD498MODEL Fri Nov 29 10:24:48 MET 1996 

# residue area 
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TRP_1 
SER_2 
PRO_3 
ASN_4 
5 ASP_5 
PRO_6 
TYR_7 
TYR_8 
SER_9 

10 ALA_10 
TYR_11 
GLN_12 
TYR_13 
GLY_14 

15 PRO_15 
GLN_16 
-ASN-1-7 
THR_18 
SER_19 

20 THR_20 
PRO_21 
ALA_22 
ALA_23 
TRP_24 

25 ASP_25 
VAL_26 
THR_27 
ARG_28 
GLY_29 

30 SER_30 
SERJ31 
THR_32 
GLN_33 
THR_34 

35 VAL_35 
ALA_36 
VAL_37 
LEU_38 
ASP_39 

40 SER_40 
GLY_41 
VAL_42 
ASP_43 
TYR_44 

4 5 ASN_45 
HIS_46 
PRO_47 
ASP_48 
LEU_49 

50 ALA_50 
ARG_51 
LYS_52 
VAL_53 
ILE__54 

55 LYS_55 
GLY_56 
TYR 57 



136.275711 

88.188095 

15.458788 

95.322319 

4.903404 

68.096909 

93.333252 

31.791576 

95.983139 

77.983536 

150.704727 

26.983349 

44.328232 

3.200084 

2.149547 

61.385445 

-37.-7-76707 

1.237873 

41.031750 

4.321402 

16.658991 

42.107288 

0.000000 

3.713619 

82.645493 

74.397812 

14.950654 

110.606209 

0.242063 

57.225292 

86.986198 

1.928865 

42.008949 

0.502189 

0.268693 

0.000000 

5.255383 

1.550332 

3.585718 

2.475746 

4.329043 

1.704864 

25.889742 

89.194855 

109.981819 

0.268693 

66.580925 

0.000000 

0.770882 

49.618046 

218.751709 

18.808538 

39.937984 

98.478104 

103.612228 

17.199390 

67.719147 
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ASP 58 


0,000000 




PHE 59 


40.291119 




ILE 


60 


50.151962 




asp" 


"61 


70.078888 


5 


arg" 


"62 


166.777557 




asp" 


"63 


35.892376 




asn" 


"64 


120.641953 




asn" 


"65 


64.982895 




pro" 


"66 


6.986028 


10 


met" 


"67 


58.504269 




asp" 


"68 


28.668840 




leu" 


"69 


104.467468 




asn" 


"70 


78.460953 




gly" 


"71 


5.615932 


15 


his" 


"72 


43.158905 




gly" 


"73 


0.268693 




thr" 


"74 


0. 000000 




his" 


"75 


0.484127 




VAL 76 


1.880854 


20 


ALA 77 


0.000000 




GLY 


78 


0.933982 




THR" 


"79 


9.589676 




val" 


"80 


0.000000 




ALA" 


"81 


0.000000 


25 


ALA 82 


0.000000 




ASP 83 


46.244987 




THR 


84 


27.783333 




ASN 


"85 


75.924225 




ASN" 


"86 


44.813908 


30 


gly" 


"87 


50.453152 




ILE" 


'88 


74.428070 




gly" 


"89 


4.115077 




val" 


'90 


6.717335 




ALA" 


"91 


2.872341 


35 


GLY" 


"92 


0.233495 




met" 


"93 


5.876057 




ALA" 


"94 


0.000000 




pro" 


"95 


17.682203 




asp" 


"96 


83.431740 


40 


thr" 


'97 


1.506567 




LYS" 


"98 


72.674973 




ILE" 


"99 


4.251006 




leu" 


"100 


6.717335 




ALA" 


"101 


0.806080 


45 


VAL" 


"102 


1,426676 




arg" 


"103 


2.662697 




val" 


"104 


2. 171855 




leu" 


"105 


18.808538 




asp" 


"106 


52.167435 


50 


ALA" 


"107 


52.905663 




ASN" 


'108 


115.871315 




gly" 


"109 


30.943356 




ser" 


"no 


57.933651 




gly" 


"ill 


50.705326 


55 


ser" 


"112 


56.383320 




leu" 


"113 


71.312195 




asp" 


"114 


110,410919 
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SER_115 
ILE_116 
ALA_117 
SER_118 
5 GLY_119 
ILE_120 
ARG_121 
TYR_122 
ALA_123 

10 ALA_124 
ASP_125 
GLN_126 
GLY_127 
ALA_128 

15 LYS_129 
VALJL30 
LEU_1-31 
ASN_132 
LEU_133 

20 SER_134 
LEUJL35 
GLY_136 
CYS_137 
GLU_138 

25 CYS_139 
ASN_140 
SER_141 
THR_142 
THR_143 

30 LEU_144 
LYS_145 
SER_146 
ALA_147 
VALJL48 

35 ASPJL49 
TYR_150 
ALA_151 
TRP_152 
ASN_153 

40 LYS_154 
GLY_155 
ALA_156 
VAL_157 
VAL_158 

45 VAL_159 
ALA_160 
ALA_161 
ALA_162 
GLY_163 

50 ASN_164 
ASP_165 
ASN_166 
VAL_167 
SER_168 

55 ARG_169 
THR_170 
PHE 171 



13.910152 

22.570246 

5.642561 

29.313131 

0.000000 

1.343467 

118.391129 

44.203033 

0.000000 

7.974043 

83.851639 

64.311974 

36.812618 

4.705107 

90.886139 

1.039576 

2.14954-7 

4.315227 

1.880854 

3.563334 

26.371397 

59.151070 

63.333755 

111.553314 

83.591461 

80.757843 

25.899158 

99.889725 

73.323814 

5.589301 

94.708755 

72.636993 

9.235920 

1.612160 

57.431465 

106.352493 

0.268693 

43.133667 

112.864975 

110.009468 

33.352180 

3.493014 

1.048144 

2.043953 

0.000000 

0.537387 

10.872165 

7.823834 

12.064573 

81.183388 

64.495300 

83.457443 

68.516815 

78.799652 

116.937134 

57.275074 

51.416462 
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GLN 


172 


18.934589 




pro" 


'173 


1.880854 




ALA" 


'174 


6.522357 




SER" 


*175 


26.184139 


5 


tyr" 


"176 


21.425076 




pro" 


'177 


85.613541 




asn" 


'178 


34.700817 




ALA 179 


0.268693 




ILE 180 


1.074774 


10 


ALA 


181 


3 .761708 




VAL 182 


0.000000 




GLY 183 


2.149547 




ALA 


184 


0.951118 




ILE" 


'185 


0.806080 


15 


asp" 


'186 


30.022263 




ser" 


~1B7 


72.518509 




ASN 


188 


J. X / ■ l^OU^ X 




ASP 189 


47.601345 




ARG 190 


150.050873 


20 


LYS 


191 


64.822807 




ALA 


'192 


2.686934 




SER" 


"193 


96.223808 




PHE 


'194 


51.482613 




SER~ 


"195 


1.400973 


25 


ASN" 


"196 


4*148808 




tyr" 


'197 


80.937309 




gly~ 


"l98 


10.747736 




THR~ 


'199 


93.221252 




TRP" 


"200 


169.943604 


30 


VAL" 


'201 


15.280325 




asp" 


"202 


12.141763 




VAL 203 


0.268693 




THR 204 


3.409728 




ALA 


205 


0.000000 


35 


pro" 


'206 


0.000000 




gly" 


"207 


0.000000 




VAL 


"208 


37.137192 




ASN" 


"209 


78.286270 




ILE" 


'210 


9.404268 


40 


ala" 


"211 


25.938599 




ser" 


212 


5.037172 




THR~ 


'213 


0.000000 




VAL" 


"214 


22.301552 




PRO" 


'215 


45.251030 


45 


asn" 


"216 


131.014160 




asn" 


"217 


88.383461 




gly" 


"218 


21.226780 




tyr" 


"219 


88.907570 




ser" 


"220 


39.966541 


50 


tyr" 


"221 


166.037018 




met" 


"222 


50.951096 




ser" 


*223 


54.435001 




gly" 


'224 


1.880854 




thr" 


"225 


1.634468 


55 


ser" 


"226 


17.432346 




met" 


"227 


7.233279 




ALA" 


"228 


0.000000 
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SER 


229 




PRO" 


"230 




his" 


"231 




val" 


"232 


5 


ALA" 


"233 




gly" 


"234 




leu" 


"235 




ALA" 


"236 




ALA 237 


10 


LEU 238 




LEU 


239 




ALA 


"240 




SER" 


"241 




gln" 


"242 


15 


gly" 


"243 




lys" 


"244 




ASN 245 




ASN 246 




VAL 


247 


20 


GLN" 


"248 




ile" 


"249 




arg" 


"250 




gln" 


"251 




ala" 


"252 


25 


ile" 


"253 




GLU" 


"254 




GLN 255 




THR 256 




ALA 


257 


30 


ASP" 


"258 




LYS" 


"259 




ILE 260 




SER 261 




GLY 


262 


35 


THR 263 




GLY 264 




THR 


265 




ASN~ 


"266 




phe" 


"267 


40 


LYS" 


"268 




TYR 269 




GLY 270 




LYS 271 




ILE 272 


45 


ASN 


273 




SER" 


"274 




asn" 


"275 




LYS" 


"276 




ALA" 


"277 


50 


VAL" 


'278 




arg' 


"279 




TYR" 


'280 



CA_281 
CA_282 
55 CA 283 



0.000000 
0.268693 
2.680759 
0.000000 
0.000000 

I. 074774 

II. 500556 
0.000000 
0.000000 

I. 612160 
0.000000 
10.648088 
39.138004 
71.056175 
66.487144 
43.256012 
80 .728127 
34.859673 
84.145645 
51.819775 
8.598188 
35.055809 
71.928093 
0.000000 
4.845899 
13.344438 
81.705254 
9.836061 
2.810513 
44.656136 
113.071686 
32.089527 
91.590103 
26.450439 
38.308762 
46.870056 
88.551804 
34.698349 
7.756911 
103.212852 
37.638382 
0.000000 

II. 376978 
2.885231 
19.195255 
2.651736 
38.177547 
84.549576 
1.074774 
4.775503 
162.693054 
96.572929 
0.000000 
0.000000 
8.803203 
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restmole. list 
Subset REST: 

PD498FINALMODEL: 6-7 , 9-12 , 43-46 , 61-63 , 65 , 87- 
89 , 111-114 , 117-118 , 131 , 
5 PD4 9 8 FINALMODEL: 137-139 , 158-159 , 169-171 , 173- 
174,180-181,209,211, 

PD498FINALMODEL: 216-221, 232-233, 262 ,E282H 

rest atom. list 
Subset REST: 
10 PD4 9 8 FINALMODEL : PRO 6 :N, CA, CD, C,0, CB, CG 

PD4 9 8 FINALMODEL : TYR 7 : N , CA , C , O , CB , CG , CD1 , CD 2 , CE1 , CE2 , CZ , OH 
PD4 9 8 FINALMODEL : SER 9 : N , CA , C , O , CB , OG 
PD4 9 8 FINALMODEL: ALA 10 :N,CA, C,0,CB 

PD498FINALMODEL:TYR 11:N,CA, C,0,CB, CG,CD1,CD2 , CE1,CE2 , CZ,OH 
15 PD498FINALMODEL: GLN 12 :N, CA, C,0,CB,CG,CD,OEl,NE2 

PD4 9 8 FINALMODEL : ASP 4 3 : N , CA , C , O , CB , CG , OD1 , OD2 
PD4 9 8FINALMODEL : TYR 

4 4 : N , CA , C , O , CB , CG , CD 1 , CD2 , CE 1 , CE2 , CZ , OH 
PD49 8 FINALMODEL : ASN 4 5 : N , CA , C , O , CB , CG , OD 1 , ND2 
2 0 PD4 9 8 FINALMODEL : HI S 

46:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
PD4 9 8 FINALMODEL : ASP 6 1 : N , CA , C , O , CB , CG , OD1 , OD2 
PD4 9 8 FINALMODEL : ARG 
62:N,CA,C,0,CB,CG,CD,NE / CZ,NH1,NH2 
25 PD4 9 8 FINALMODEL: ASP 63 :N, CA,C,0,CB,CG,ODl,OD2 

PD4 9 8 FINALMODEL: ASN 65 :N, CA,C,0,CB,CG,0D1,ND2 
PD4 9 8 FINALMODEL :GLY 87:N,CA,C,0 

PD49 8 FINALMODEL : ILE 8 8 : N , CA , C , O , CB , CGI , CG2 , CD1 

PD4 9 8 FINALMODEL :GLY 89:N,CA,C,0 
30 PD4 9 8 FINALMODEL :GLY 111:N,CA,C,0 

PD4 9 8 FINALMODEL: SER 112 : N, CA, C,0, CB,OG 

PD498 FINALMODEL : LEU 1 1 3 : N , CA , C , 0 , CB , CG , CD 1 , CD2 

PD498 FINALMODEL : ASP 1 1 4 : N , CA , C , O , CB , CG , OD 1 , OD2 

PD4 9 8 FINALMODEL: ALA 117 :N, CA, C,0, CB 
35 PD4 9 8 FINALMODEL: SER 118 : N, CA, C, O, CB,OG 

PD4 9 8FINALMODEL : LEU 131:N,CA,C,0,CB,CG,CD1,CD2 

PD498FINALMODEL: CYS 137 :N, CA, C,0, CB, SG 

PD4 9 8 FINALMODEL : GLU 
138:N,CA,C,0,CB / CG,CD / OEl,OE2 
40 PD4 9 8 FINALMODEL: CYS 139 :N, CA, C,0, CB, SG 

PD4 9 8 FINALMODEL :VAL 158 : N, CA, C,0, CB , CGI , CG2 

PD4 9 8 FINALMODEL :VAL 159 : N, CA, C,0,CB, CGI , CG2 

PD498 FINALMODEL : ARG 
169:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
45 PD4 9 8 FINALMODEL :THR 170:N,CA, 0,0,06,001, CG2 

PD4 9 8 FINALMODEL : PHE 
171:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

PD4 9 8 FINALMODEL: PRO 173:N,CA,CD,C,0,CB,CG 

PD4 9 8 FINALMODEL: ALA 174 :N, CA, C,0, CB 
50 PD498FINALMODEL: ILE 180:N,CA,C,0,CB,CG1, CG2 , CD1 

PD4 9 8 FINALMODEL: ALA 181:N, CA, C,0,CB 

PD4 9 8 FINALMODEL : ASN 2 09 : N , CA , C , 0 , CB , CG , OD1 , ND2 

PD4 9 8 FINALMODEL: ALA 211:N, CA, C,0,CB 

PD4 98 FINALMODEL : ASN 216:N, CA, C,0, CB, CG, OD1 , ND2 
55 PD4 9 8 FINALMODEL: ASN 2 1 7 : N , CA , C , O , CB , CG , OD1 , ND2 

PD4 9 8 FINALMODEL :GLY 218:N,CA,C,0 
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PD4 9 8 FINALMODEL : TYR 

219 :N, CA,C,0,CB,CG,CDl,CD2,CEl,CE2,CZ,OH 
PD4 9 8 FINALMODEL : SER 220:N,CA,C,O,CB,OG 
PD498 FINALMODEL : TYR 
5 221:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

PD4 9 8 FINALMODEL : VAL 2 3 2 : N , CA , C , O , CB , CGI , CG2 
PD4 9 8 FINALMODEL: ALA 233 :N, OA, C, O, CB 
PD4 9 8 FINALMODEL : GLY 262 :N, OA, 0,0 
PD4 9 8 FINALMODEL: OA E282H:CA 

10 

Subset SUB5B: 

subSbmole . list 
Subset SUB5B: 

PD4 9 8 FINALMODEL: 4-5 , 8 , 13-16 , 34-35 , 47- 
15 51 ,53 , 64 , 83 , 85-86 , 90-91 , 120-124 , 

PD4 9 8 FINALMODEL: 128-130 , 140-141, 143-144 , 147- 
148 , 151-152,156-157 , 

PD4 9 8 FINALMODEL: 165 , 167-168 , 172 , 175-176 , 178- 
179, 196, 200-205,208, 
20 PD4 9 8 FINALMODEL: 234-237, 250, 253-254, 260-261, 263- 

267,272,E281H, 

PD4 9 8 FINALMODEL : E283H 

subSbatom. list 

25 Subset SUB5B: 

PD498 FINALMODEL : ASN 4 : N , OA , C , O , CB , CG , OD 1 , ND2 
PD4 9 8 FINALMODEL : ASP 5 : N , OA , C , O , CB , CG , OD 1 , OD2 
PD4 9 8 FINALMODEL : TYR 
8:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

30 PD4 9 8 FINALMODEL: TYR 

13:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 9 8 FINALMODEL: GLY 14:N,CA,C,0 
PD4 9 8 FINALMODEL: PRO 15 :N, CA, CD, C, O, CB, CG 
PD4 9 8 FINALMODEL : GLN 16 : N , CA, C, O , CB , CG , CD , OE1 ,NE2 

35 PD4 9 8 FINALMODEL :THR 3 4 : N , CA , C , O , CB , OG1 , CG2 

PD4 9 8 FINALMODEL : VAL 3 5 : N , CA , C , O , CB , CGI , CG2 
PD 4 9 8 F I N ALMODEL : PRO 4 7 : N , CA , CD , C , O , CB , CG 
PD4 9 8 FINALMODEL : ASP 48 :N,CA,C,0, CB, CG,0D1 , OD2 
PD4 9 8 FINALMODEL : LEU 4 9 : N , CA , C , O , CB , CG , CD1 , CD2 

40 PD4 9 8 FINALMODEL: ALA 50:N, CA,C,0, CB 

PD4 9 8 FINALMODEL : ARG 

51:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD4 9 8 FINALMODEL: VAL 53 : N , CA, C, O, CB, CGI , CG2 
PD4 9 8 FINALMODEL : ASN 64 : N , CA , C , O , CB , CG , OD1 , ND2 

45 PD498FINALMODEL: ASP 83 :N,CA,C,0,CB,CG,ODl,OD2 

PD4 9 8 FINALMODEL : ASN 85 : N , CA, C, O, CB, CG, OD1 , ND2 
PD4 9 8 FINALMODEL : ASN 86 :N, CA,C,0,CB,CG,0D1,ND2 
PD4 9 8 FINALMODEL: VAL 90:N,CA,C,0,CB,CG1,CG2 
PD498FINALMODEL: ALA 91:N,CA,C,0,CB 

50 PD4 9 8 FINALMODEL : ILE 120 :N, CA, C, O, CB, CGI, CG2 , GDI 

PD4 9 8 FINALMODEL : ARG 

121:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD498 FINALMODEL : TYR 
122:N,CA,C,0,CB,CG,CDl,CD2,CEl,CE2,CZ,OH 

55 PD4 9 8 FINALMODEL: ALA 123 :N, CA, C,0, CB 

PD4 9 8 FINALMODEL : ALA 124 :N, CA, C,0,CB 
PD4 9 8 FINALMODEL: ALA 128 : N , CA , C , O , CB 
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PD498FINALMODEL:LYS 1 2 9 : N , CA , C , O , CB , CG , CD , CE , N Z 

PD4 9 8 FINALMODEL : VAL 130 :N, CA f C,0, CB, CG1,CG2 

PD498FINALMODEL: ASN 140:N,CA, C,0,CB,CG,0D1 ,ND2 

PD4 9 8 FINALMODEL: SER 141:N,CA, C,0,CB,OG 
5 PD498FINALM0DEL:THR 143 :N,CA,C,0,CB,0G1, CG2 

PD4 9 8 FINALMODEL : LEU 144 : N , CA , C , O , CB , CG , CD1 , CD 2 

PD4 9 8 FINALMODEL: ALA 147 :N,CA,C,0,CB 

PD4 9 8 FINALMODEL: VAL 148 :N, CA, C,0,CB, CGI, CG2 

PD4 9 8 FINALMODEL: ALA 151:N,CA, C,0,CB 
10 PD4 9 8 FINALMODEL :TRP 

52:N,CA,C,0,CB,CG,CD1,CD2,NE1,CE2,CE3, 
C22,CZ3,CH2 

PD4 9 8 FINALMODEL: ALA 156:N,CA,C,0,CB 

PD4 9 8 FINALMODEL: VAL 157 :N,CA,C,0,CB, CG1,CG2 
15 PD4 9 8 FINALMODEL: ASP 165:N,CA,C,0,CB, CG,0D1,0D2 

PD4 9 8 FINALMODEL : VAL 167 :N,CA, C,0,CB, CGI, CG2 

PD4 9 S FINALMODEL ; 5 ER 168 :N,GA, C,0> GB,GG 

PD4 9 8 F INALMODEL : GLN 

172:N,CA,C,0,CB,CG,CD,0E1,NE2 
20 PD4 9 8 FINALMODEL: SER 175 :N, CA, C,0, CB,OG 

PD4 9 8 FINALMODEL : TYR 

176:N,CA,C,0,CB,CG,CD1,CD2,CE1 / CE2,CZ,0H 

PD4 9 8 FINALMODEL: ASN 178 :N, CA, C,0, CB, CG, 0D1 , ND2 

PD4 9 8 FINALMODEL: ALA 179 :N, CA, C,0, CB " 
25 PD4 9 8 FINALMODEL: ASN 196 :N, CA, C,0, CB, CG,0D1 ,ND2 

PD4 9 8 FINALMODEL : TRP 

200:N,CA,C,O / CB,CG,CDl,CD2,NEl,CE2,CE3, 
CZ2,CZ3,CH2 

PD4 9 8 FINALMODEL : VAL 201:N,CA, C,0,CB,CG1, CG2 
30 PD4 9 8 FINALMODEL: ASP 202 :N,CA,C,0, CB, CG, OD1 ,OD2 

PD4 9 8 FINALMODEL: VAL 203 :N,CA,C,0, CB, CG1,CG2 

PD4 9 8 FINALMODEL :THR 204 :N,CA, C,0, CB,OGl,CG2 

PD4 9 8 FINALMODEL : ALA 205:N,CA, C,0,CB 

PD4 9 8 FINALMODEL : VAL 208:N,CA,C,0,CB,CG1,CG2 
35 PD4 9 8 FINALMODEL : GLY 234:N,CA,C,0 

PD4 9 8 FINALMODEL: LEU 235:N,CA,C,0, CB,CG, CD1, CD2 

PD4 9 8 FINALMODEL: ALA 236:N,CA, C,0,CB 

PD4 9 8 FINALMODEL: ALA 237 :N, CA, C, O, CB 

PD498 FINALMODEL : ARG 
40 250:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 

PD4 9 8 FINALMODEL : ILE 2 5 3 : N , CA , C , O , CB , CG 1 , CG 2 , CD 1 

PD4 9 8 FINALMODEL: GLU 

254:N,CA,C,0,CB,CG,CD,OEl,OE2 

PD4 9 8 FINALMODEL : ILE 2 6 0 : N , CA , C , O , CB , CG 1 , CG2 , CD 1 
45 PD4 9 8 FINALMODEL: SER 261:N,CA, C,0, CB,OG 

PD4 9 8 FINALMODEL: THR 263 : N , CA, C , 0 , CB , OG1 , CG2 

PD4 9 8 FINALMODEL: GLY 264:N,CA,C,0 

PD4 9 8 FINALMODEL : THR 265:N,CA,C,0,CB,0G1,CG2 

PD4 9 8 FINALMODEL : ASN 2 6 6 : N , CA , C , O , CB , CG , OD1 , ND2 
5 0 PD4 9 8 FINALMODEL : PHE 

267:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

PD4 9 8 FINALMODEL: ILE 272 : N, CA, C , O , CB , CGI , CG2 , CD1 

PD4 9 8 FINALMODEL :CA E281H:CA 

PD4 9 8 FINALMODEL :CA E283H:NA 



55 



Subset ACTSITE: 

actsitemol .list 



- WO 98/35026 



52 



PCI7DK98/00046 



Subset ACTSITE: 

PD498FINALMODEL: 36-42 , 57-60 , 66-80 , 100-110 , 115- 
116 , 119 , 132-136 , 160-164 , 

PD4 9 8 FINALMODEL : 182-184 , 194 , 206-207 , 210 , 212- 
5 215,222-231 

actsiteatom. list 
Subset ACTSITE: 

PD4 9 8 FINALMODEL : ALA 36 :N, CA, C f 0, CB 

10 PD4 9 8 FINALMODEL :VAL 37 : N, CA, C, 0,CB, CGI, CG2 

PD4 9 8 FINALMODEL : LEU 3 8 : N , CA , C , O , CB , CG , CD 1 , CD2 
PD4 9 8 FINALMODEL: ASP 39 :N, CA, C,0, CB, CG, ODl,OD2 
PD4 9 8 FINALMODEL :SER 40:N,CA, C,0,CB,OG 
PD4 9 8 FINALMODEL :GLY 41:N,CA,C,0 

15 PD4 9 8 FINALMODEL :VAL 42:N, CA, C,0, CB, CGI, CG2 

PD4 9 8 FINALMODEL : TYR 

-57 : N , CA-, C 7 G T CB , CG ,-CDl , GD2 ,-G-E-l , CE2 , CZ , OH 
PD498 FINALMODEL : ASP 5 8 : N , CA , C , 0 , CB , CG , OD1 , OD2 
PD4 9 8 FINALMODEL : PHE 

20 59:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

PD4 9 8 FINALMODEL : ILE 60 : N, CA, C , O , CB , CGI , CG2 , CD1 
PD4 9 8 FINALMODEL : PRO 6 6 : N , CA , CD , C , O , CB , CG 
PD4 9 8 FINALMODEL: MET 67 :N,CA, C, O, CB f CG, SD, CE 
PD4 9 8 FINALMODEL : ASP 6 8 : N , CA , C , O , CB , CG , OD 1 , OD2 

25 PD4 9 8 FINALMODEL : LEU 69 : N, CA, C,0,CB, CG, CD1, CD2 

PD4 9 8 FINALMODEL : ASN 7 0 : N , CA , C , 0 , CB , CG , OD 1 , ND2 
PD4 9 8 FINALMODEL :GLY 71:N,CA,C,0 
PD4 9 8 FINALMODEL : HI S 

72:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

30 PD4 9 8 FINALMODEL :GLY 73:N,CA,C,0 

PD4 9 8 FINALMODEL :THR 74 :N, CA f C,0, CB,0G1, CG2 
PD4 9 8 FINALMODEL : HIS 

75:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
PD4 9 8 FINALMODEL :VAL 76:N,CA,C,0,CB,CG1,CG2 

35 PD4 9 8 FINALMODEL: ALA 77 :N, CA, C,0,CB 

PD4 9 8 FINALMODEL :GLY 78:N,CA,C,0 
PD4 9 8 FINALMODEL :THR 79 :N, CA, C,0, CB,0G1 , CG2 
PD4 9 8 FINALMODEL :VAL 80 :N,CA,C,0, CB, CGI , CG2 
PD498 FINALMODEL : LEU 1 0 0 : N , CA , C , O , CB , CG , CD 1 , CD2 

40 PD4 9 8 FINALMODEL: ALA 101:N, CA, C, O, CB 

PD4 98 FINALMODEL :VAL 102 :N,CA, C, O, CB, CGI , CG2 
PD4 9 8 FINALMODEL : ARG 

103:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD4 9 8 FINALMODEL :VAL 104 :N,CA, C, O, CB, CGI , CG2 

45 PD4 9 8 FINALMODEL: LEU 105 :N, CA, C, O, CB, CG , CD1 , CD2 

PD498 FINALMODEL : ASP 1 0 6 : N , CA , C , O , CB , CG , OD1 , OD2 
PD4 9 8 FINALMODEL: ALA 107 :N,CA, C,0,CB 
PD4 9 8 FINALMODEL : ASN 108 :N,CA, C,0, CB,CG,0D1,ND2 
PD4 9 8 FINALMODEL :GLY 109:N,CA,C f O 

50 PD498FINALMODEL: SER 110 :N,CA, C,0, CB,OG 

PD4 9 8 FINALMODEL : SER 1 1 5 : N , CA , C , 0 , CB , OG 
PD4 9 8 FINALMODEL: ILE 116 :N, CA, C, O, CB, CGI, CG2 , CD1 
PD4 9 8 FINALMODEL :GLY 119:N,CA,C,0 
PD4 9 8 FINALMODEL : ASN 1 3 2 : N , CA , C , O , CB , CG , OD1 , ND 2 

55 PD4 9 8 FINALMODEL: LEU 133 :N,CA, C,0, CB,CG, CD1,CD2 

PD4 9 8 FINALMODEL: SER 134 :N,CA,C,0, CB,OG 
PD4 9 8 FINALMODEL : LEU 1 3 5 : N , CA , C , O , CB , CG , CD 1 , CD 2 
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PD4 9 8 FINALMODEL : GLY 136:N,CA,C,0 
PD4 9 8 FINALMODEL : ALA 160 : N, CA, C, O, CB 
PD4 9 8 FINALMODEL: ALA 161:N,CA,C,0,CB 
PD4 9 8 FINALMODEL : ALA 162 :N, CA,C,0,CB 
5 PD4 9 8 FINALMODEL: GLY 163:N,CA,C,0 

PD4 9 8 FINALMODEL : ASN 164 : N , CA , C , O , CB , CG , 0D1 , ND2 
PD4 9 8 FINALMODEL : VAL 182:N,CA,C,0,CB,CG1,CG2 
PD4 9 8 FINALMODEL: GLY 183:N,CA,C,0 
PD498FINALM0DEL: ALA 184 :N,CA,C f O, CB 
1 0 PD4 9 8 FINALMODEL : PHE 

194:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
PD4 9 8 FINALMODEL: PRO 206 :N, CA, CD, C, O, CB, CG 
PD4 9 8 FINALMODEL : GLY 207:N,CA,C,O 

PD4 9 8 FINALMODEL: ILE 2 10 : N , CA, C , O , CB , CGI , CG2 , CD1 
15 PD4 9 8 FINALMODEL : SER 212 :N, CA, C, O, CB,OG 

PD4 9 8 FINALMODEL : THR 213 :N,CA,C,O,CB,0Gl, CG2 
PD498F-INALM0DEL : VAL 214 : N T GA-, G, Q, GB , GG-1 , GG2 
PD4 9 8 FINALMODEL: PRO 2 15 :N,CA, CD, C, O, CB, CG 
PD4 9 8 FINALMODEL : MET 222 :N, CA, C,0, CB, CG, SD, CE 
20 PD4 9 8 FINALMODEL: SER 223 :N, CA, C,0, CB, OG 

PD4 9 8 F INALMODEL : GLY 224:N,CA,C,0 
PD4 9 8 FINALMODEL : THR 225:N,CA,C,O,CB,0Gl,CG2 
PD4 9 8 FINALMODEL: SER 226 :N, CA, C, O, CB, OG 
PD49 8 FINALMODEL : MET 227:N,CA,C,0,CB,CG,SD,CE 
25 PD4 9 8 FINALMODEL: ALA 228 :N, CA, C, O, CB 

PD4 9 8FINALM0DEL : SER 229 :N, CA, C, O, CB, OG 
PD4 9 8 FINALMODEL: PRO 230 :N, CA,CD, C, O, CB,CG 
PD4 9 8 FINALMODEL : HIS 
231:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

30 

Subset RESTx: 

restxmole . list 
Subset RESTX: 

NEWMODEL: 6-7 , 9-12 , 43-46 , 65 , 87- 
35 89,131,173,209,211,216-221,232-233, 
NEWMODEL: 262 , E282H 

restxatom. list 
Subset RESTX: 
40 NEWMODEL : PRO 6 :N, CA, CD, C, O, CB, CG 

NEWMODEL : TYR 
7:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

NEWMODEL : SER 9:N,CA f C,0,CB,OG 

NEWMODEL : ALA 10:N,CA, C,0, CB 
45 NEWMODEL: TYR 

1 1 : N , CA , C , 0 , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 

NEWMODEL :GLN 12 :N,CA, C,0, CB,CG, CD,0E1,NE2 

NEWMODEL: ASP 43 :N,CA, C,0, CB,CG,ODl,OD2 

NEWMODEL: TYR 
50 44:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

NEWMODEL: ASN 45:N,CA,C,0, CB,CG,0D1,ND2 

NEWMODEL : HIS 46 :N, CA, C,O f CB, CG,ND1, CD2 , CE1 ,NE2 

NEWMODEL: ASN 65:N,CA,C,0, CB,CG,0D1,ND2 

NEWMODEL : GLY 87:N,CA,C,0 
55 NEWMODEL : ILE 88:N,CA,C,0,CB,CG1,CG2,CD1 

NEWMODEL : GLY 89:N,CA,C,0 

NEWMODEL : LEU 1 3 1 : N , CA , C , O , CB , CG , CD 1 , CD2 
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NEWMODEL : PRO 17 3 : N , CA , CD , C , O , CB , CG 
NEWMODEL: ASN 209 : N, CA , C , O, CB , CG, OD1 , ND2 
NEWMODEL : ALA 211 : N, CA, C,0, CB 
NEWMODEL : ASN 2 1 6 : N , CA , C , O , CB , CG , OD 1 , ND2 
5 NEWMODEL : ASN 2 17 : N , CA ,C,0, CB , CG , OD1 , ND2 

NEWMODEL: GLY 218:N,CA,C,0 
NEWMODEL: TYR 
219 :N, CA, C, O, CB, CG,CD1,CD2, CE1 / CE2,CZ, OH 
NEWMODEL: SER 220 :N, CA, C,0, CB,OG 

10 NEWMODEL: TYR 

221:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
NEWMODEL : VAL 232 :N, CA, C,0, CB, CGI, CG2 
NEWMODEL : ALA 233 :N, CA, C,0, CB 
NEWMODEL: GLY 262:N,CA,C f O 

15 NEWMODEL: CA E282H:CA 



Example 2 

Suitable substitutions in Savinase® for addition of amino 
20 attachment groups f-NHo ) 

The known X-ray structure of Savinase® was used to find 
where suitable amino attachment groups may is added (Betzel et 
al, (1992), J. Mol. Biol- 223, p. 427-445). 

The 3D structure of Savinase® is available in -the Brookhaven 
25 Databank as lsvn.pbd. A related subtilisin is available as 
lst3.pdb« 

The sequence of Savinase® is shown in SEQ ID NO. 3 
The sequence numbering used is that of subtilisin BPN', 
Savinase® having deletions relative to BPN 1 at positions: 36, 
30 56, 158-159 and 163-164. The active site residues (functional 
site) are D32,H64 and S221. 

The commands performed in Insight (BIOSYM) are shown in the 
command files makeKzone.bcl and makeKzone2 .bcl below: 



35 Conservative substitutions: 
makeKzone.bcl 
Delete Subset * 

Color Molecule Atoms * Specified Specification 255,0,255 
Zone Subset LYS :lys:NZ Static monomer /residue 10 Color_Subset 
40 255,255,0 

Zone Subset NTERM :el:N Static monomer /residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline ACTSITE residues according to the protein 
Zone Subset ACTSITE : e32 , e64 , e221 Static monomer/residue 8 
45 Color_Subset 255,255,0 

Combine Subset ALLZONE Union LYS NTERM 

Combine Subset ALLZONE Union ALLZONE ACTSITE 

#NOTE: editnextline object name according to the protein 
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Combine Subset REST Difference SAVI8 ALL ZONE 
List Subset REST Atom Output File restatom. list 
List Subset REST monomer/ residue Output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
5 List Subset ACTSITE Atom Output_File actsiteatom. list 
List Subset ACTSITE monomer /residue Output_File 
act s i t emo 1 e . 1 is t 
# 

Zone Subset REST5A REST Static Monomer /Residue 5 -Color_Subset 
10 Combine Subset SUB 5 A Difference REST5A ACTSITE 
Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
List Subset SUB5B Atom Output File sub5batom. list 
List Subset SUB5B monomer /residue Output_File subSbmole. list 
15 #Now identify sites for lys->arg substitutions and continue 
with ma kezo ne2» bcl 

#Use grep command to identify ARG in restatom. list , 
subSbatom. list & aces i tea torn. list 

20 Comments: 

In this case of Savinase® REST contains the Arginines ArglO, 
Argl70 and Arg 186, and SUB5B contains Argl9, Arg45, Argl45 and 
Arg247. 

These residues are all solvent exposed. The substitutions 
25 R10K, R19K, R45K, R145K, R170K, R186K and R247K are identified 
in Savinase® as sites for' mutagenesis within the scope of this 
invention. The residues are substituted below in section 2, 
and further analysis done. The subset ACTSITE contains Lys94. 
The substitution K94R is a mutation removing Lysine as 
30 attachment group close to the active site. 



Non-conservative substitutions: 
makeKzone2 .bcl 

#sourcefile makezone2.bcl Claus von der Osten 961128 
35 # 

#having scanned lists (grep arg command) and identified sites 
for lys->arg substitutions 

#NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace SAVI8 newmodel 
40 Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On SAVI8 

#NOTE: editnextlines with lys->arg positions 

Replace Residue newmodel :el0 lys L 
45 Replace Residue newmodel : el70 lys L 

Replace Residue newmodel :el8 6 lys L 

Replace Residue newmodel :el9 lys L 

Replace Residue newmodel :e4 5 lys L 

Replace Residue newmodel :el4 5 lys L 
50 Replace Residue newmodel:e241 lys L 
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# 

#Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

Color Molecule Atoms newmodel Specified Specification 255,0,255 
5 Zone Subset LYSx newmodel: lys:NZ Static monomer/residue 10 
Color_Subset 255,255,0 

Zone Subset NTERMx newmodel: el :N Static monomer /residue 10 
Color_Subset 255,255,0 

#N0TE : editnextline ACTSITEx residues according to the protein 
10 Zone Subset ACTSITEx newmodel :e3 2, e64 ,e2 21 Static 

monomer /residue 8 Color_Subset 255,255,0 

Combine Subset ALLZONEx Union LYSx NTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 

Combine Subset RESTx Difference newmodel ALLZONEx 
15 List Subset RESTx Atom Output File restxatom. list 

List Subset RESTx monomer /residue Output_File restxmole. list 

"# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
List Subset ACTSITEx Atom Output^File act sitexatom. list 
20 List Subset ACTSITEx monomer /residue Output_File 
actsitexmole. list 
# 

#read restxatom. list or restxmole. list to identify sites for 
(not_arg) ->lys subst. if needed 

25 

Comments : 

Of the residues in RESTx, the following are >5% exposed (see 

lists below): 5,14,22,38-40,42,75-76,82,86,103-105,108,133- 

135,137,140,173,204,206,211-213,215-216,269. The following 

30 mutations are proposed in Savinase®: P5K, P14K, T22K, T38K, 

H39K, P40K, L42K, L75K, N76K, L82K, P86K, S103K, V104K, S105K, 

A108K, A133K, T134K, L135K, Q137K, N140K, N173K, N204K, Q206K, 

G211K, S212K, T213K, A215K, S216K, N269K. 

Relevant data for Example 2 : 

3 5 Solvent accessibility data for SAVINASE® : 

# SAVI8NOH20 Fri Nov 29 13:32:07 MET 1996 





# residue 


area 




ALA 1 


118.362808 




GLN 2 


49.422764 


40 


SER 3 


61.982887 




VAL 4 


71.620255 




PRO 5 


21.737535 




TRP 6 


58.718731 




GLY 7 


4.328117 


45 


ILE 8 


6.664074 




SER 9 


60.175900 




ARG 10 


70.928963 




VAL 11 


2.686934 




GLN 12 


72.839996 


50 


ALA 13 


0.000000 




PRO 14 


52.308453 




ALA 15 


38.300892 




ALA 16 


0.000000 
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HIS_17 
ASN_18 
ARG_19 
GLY_20 
5 LEU_21 
THR_22 
GLY_23 
SER_24 
GLY_25 

10 VAL_26 
LYS_27 
VAL_28 
ALA_29 
VAL_30 

15 LEU_31 
ASP_32 
THR_33 
GLY_34 
ILEJ35 

20 SER_36 
THR_37 
HIS_38 
PRO_39 
ASP_40 

25 LEU_41 
ASN_42 
ILE_43 
ARG_44 
GLY_45 

30 GLY_46 
ALA_47 
SER_48 
PHE_49 
VAL_50 

35 PR0_51 
GLY_52 
GLU_53 
PRO_54 
SER_55 

40 THR_56 
GLN_57 
ASP_58 
GLY_59 
ASN_60 

45 GLY_61 
HIS_62 
GLY_63 
THR_64 
HIS_65 

50 VAL_66 
ALA_67 
GLY_68 
THR_69 
ILE_70 

55 ALAJ71 
ALA_72 
LEU 73 



41.826324 

136.376602 

105.678642 

48.231510 

17.196377 

36.781742 

0.000000 

64.151276 

50.269905 

4.030401 

54.239555 

0.000000 

0.000000 

3.572827 

0.233495 

1.074774 

1.973557 

3.638052 

8.044439 

8.514903 

122.598907 

18.834011 

76.570526 

0.000000 

19.684013 

88.870216 

56.117710 

110.647194 

26.935413 

35.515778 

21.495472 

34.876190 

52.647541 

23.364208 

110.408752 

80.282906 

43.033707 

124.444336 

60.284889 

47.103241 

120.803505 

12.784743 

61.742443 

56.760231 

1.576962 

38.590118 

0.000000 

0.537387 

0.968253 

1.612160 

0.000000 

2.801945 

9.074596 

0.000000 

4.577205 

0.000000 

47.290039 
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58 





ASN 


74 


10? 1R7248 




ASN~ 


"75 


60 ?1 OAOO 




ser" 


"76 


OA 614494 




ile" 


'77 


66 098572 


zj 


gly" 


'78 


17 979534 

X / • J / *J -J ^ 




val" 


"79 


5 642561 




leu" 


"80 


1*1 025185 




gly" 


"81 


o nnnnnn 




val" 


"82 


n 3£ft£Qi 




ALA" 


"83 


n nnonnn 




pro" 


"84 


ID 1 Q*1Q1 n 
lOi .L 7 .J O X U 




ser" 


"85 


Cfi Q'lQn'lQ 




ALA 86 






GLU 87 


77 O! 1 76*5 


ID 


LEU 


88 


2 1 




TYR" 


"89 


JU» g j J JIO 




ALA" 


"90 


1 7A7A67 




VAL" 


"91 


n 77QA50 




LYS" 


"92 


R ft 62 7ft 1 




VAL" 


"93 


fl A66QQ1 




LEU 94 


1 fl 7477 7 6 




GLY 95 


ft 7 07 1 09 




ALA 96 


A1 41 4677 




SER 


"97 


Q6 066040 




GLY" 


"98 


-5*1 7 7 44 85 




ser" 


"99 


67 664116 




GLY" 


"100 


3 5. 571117 




ser" 


"101 


54 096992 




val" 


"102 


52 . 695324 


30 


ser" 


"103 


62 . 929684 




SER 104 


8 . 683097 




ILE 105 


15.852910 




ALA 


106 


14.509443 




GLN" 


"107 


94.463066 


35 


gly" 


"108 


0.000000 




leu" 


"109 


0.537387 




glu" 


'110 


63 . 227707 




TRP" 


"ill 


55.500740 




ALA" 


"112 


0.502189 


40 


GLY" 


"113 


11. 908267 




asn" 


"114 


107.208527 




asn" 


"115 


78. 811234 




GLY" 


"116 


41.453194 




MET" 


"117 


9.634291 


45 


his" 


"118 


54.022118 




val" 


"119 


5.105174 




ALA" 


"120 


0.268693 




ASN" 


"121 


0.233495 




leu" 


"122 


0.537387 


50 


ser" 


"123 


4.004620 




leu" 


"124 


21.927265 




GLY" 


"125 


55.952454 




SER" 


"126 


40.241180 




pro" 


"127 


107.409439 


55 


SER 128 


57.988609 




PRO 


129 


85.021118 




SER" 


"130 


20.460915 
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ALA_ 


131 


57.404362 




tor" 


132 


74 .438805 




LEtf 


133 


12 . 091203 




GLU 


134 


73 .382019 


5 


gln" 


135 


114 . 870010 




ALA" 


"136 


2 . 122917 




VAL 137 


1. 074774 




ASN 138 


55. 622704 




SER 139 


29 . 174965 


10 


ALA 140 


0 .268693 




THR 


141 


27 .962946 




SER~ 


"142 


87.263145 




arg" 


'143 


88 .201218 




GLY 


'144 


38.477882 


15 


VAL 145 


2 . 079151 




LEU 146 


13 .703363 




VAL 


147 


2 . 690253 




VAL 148 


1 . 074774 




ALA 149 


0 . 000000 


20 


ALA 


150 


4 . 356600 




SER 


151 


0. 000000 




GLY 


152 


12 . 628590 




ASN* 


153 


84.248703 




SER~ 


154 


77.662354 


25 


GLY~ 


155 


25.409861 




ALA" 


156 


38.074570 




GLY~ 


157 


40.493744 




SER" 


158 


53.915291 




ile" 


159 


4. 352278 


30 


SER" 


160 


12.458543 




TYR*~ 


161 


29.670284 




pro" 


162 


4.030401 




ALA" 


163 


0.968253 




arg" 


"164 


84.059120 


35 


tyr" 


165 


28.641129 




ALA 166 


68.193314 




ASN 167 


61.686481 




ALA 


168 


0.537387 




met" 


169 


0.586837 


40 


ALA" 


170 


0.000000 




VAL" 


171 


0.000000 




GLY" 


172 


0.000000 




ALA" 


"173 


0.933982 




THR 174 


3.013133 


45 


ASP 175 


34.551376 




GLN 176 


96.873039 




ASN 


177 


98.664368 




ASN 


178 


41.197159 




ASN" 


"179 


60.263512 


50 


arg" 


"180 


64.416336 




ALA" 


"181 


7.254722 




SER*" 


'182 


91.590881 




PHE" 


"183 


52.126518 




SER 184 


2.101459 


55 


GLN 185 


15.736279 




TYR 


186 


44.287792 




GLY" 


"187 


5.114592 
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ALA 188 


69.406563 




GLY 189 


36.926083 




LEU 190 


16.511177 




ASP 191 


7.705349 


5 


ILE 


192 


0.268693 




VAL" 


'193 


4.299094 




ala" 


"194 


0.000000 




pro" 


*195 


0.806080 




gly" 


'196 


0.000000 


10 


val" 


*197 


25.257177 




asn" 


"198 


82.177422 




val" 


'199 


10.747736 




GLN 200 


80.374527 




SER 201 


2.008755 


15 


THR 202 


0.000000 




TYR 


203 


80.679886 




PRO"" 


"204 


34 .632195 




gly" 


"205 


74.536827 




ser" 


'206 


74 .964920 


20 


thr" 


"207 


57.070065 




tyr" 


"208 


82.895500 




ALA 


'209 


22.838940 




ser" 


'210 


69.045639 




leu" 


"211 


49.708279 


25 


asn" 


"212 


86.905457 




gly' 


'213 


2.686934 




thr" 


"214 


4.669909 




ser" 


"215 


15.225292 




met" 


"216 


7.261287 


30 


ala" 


'217 


0. 000000 




thr" 


"218 


0.000000 




pro" 


"219 


0.806080 




his" 


"220 


2.662697 




val" 


"221 


0.268693 


35 


ala" 


"222 


0.000000 




gly" 


'223 


0.000000 




ala" 


"224 


7.206634 




ala" 


"225 


1.039576 




ALA 226 


0.268693 


40 


LEU 227 


1.074774 




VAL 


228 


1.541764 




LYS" 


*229 


39.262505 




GLN" 


"230 


54.501614 




LYS" 


"231 


81.154129 


45 


asn" 


"232 


30.004124 




pro" 


"233 


91.917931 




ser" 


"2 34 


102.856705 




TRP~ 


"2 35 


64.639481 




ser" 


"236 


51.797619 


50 


asn" 


"237 


24.866917 




VAL 238 


78.458466 




GLN 239 


73.981461 




ILE 


240 


14.474245 




ARG" 


"241 


41.242931 


55 


asn" 


"242 


64.644814 




his" 


"243 


50.671440 




LEU 


"244 


5.127482 
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48.820000 
115.264534 
22.205376 
16.415077 
60.503101 
74.511597 
48.861599 
39.124340 
49.811481 
88.421982 
72.490181 
54.835758 
38.798912 
3.620916 
35.017368 
0.537387 
8.598188 
4.519700 
16.763659 
3.413124 
37.942276 
15.871746 
3.947115 
2.475746 
176.743362 
0.000000 
5.197493 
Subset REST: 

restmole. list 
30 Subset REST: 

SAVI8 : E5-E15 , E17-E18 , E22 , E38-E40 , E42-E43 , E73-E76 , E82-E86 , E103- 
E105, 

savi8 : e108-e109 , e111-e112 , e115-e116 , e122 , e128-e144 , e149- 
ei50,ei56-ei57, 

35 savi8 : e160-e162 , e165-e168 , e170-e171 , e173 , e180-e188 , e190- 

E192,E200, 

SAVI8:E203-E204, E206 , E211-E213 , E215-E216 , E227-E230 , E255- 
E259,E261-E262, 
SAVI8:E267-E269 
4 0 res tat om .list 
Subset REST: 

SAVI8:PRO E5:N,CD,CA,CG, CB,C,0 

SAVI8:TRP E6 : N, CA,CD2 , CE2 , NE1, CD1, CG, CE3 , CZ3 , CH2 , CZ2 ,CB, C, O 
SAVI8:GLY E7:N,CA,C,0 
45 SAVI8:ILE E8 :N, CA,CD1, CGI , CB, CG2 ,C,0 
SAVI8:SER E9:N, CA, 0G # CB, C,0 

SAVI8:ARG E10 : N , CA , NH2 , NH1 , CZ , NE , CD , CG , CB , C , O 
SAVI8 : VAL Ell : N , CA , CG2 , CGI , CB, C, O 
SAVI8:GLN E12:N, CA,NE2 ,OEl , CD, CG ,CB, C, O 
50 SAVI8:ALA E13 :N, CA,CB, C,0 

SAVI8:PRO E14 :N, CD,CA, CG, CB, C, 0 
SAVI8:ALA E15:N, CA,CB, C, O 

SAVI8:HIS E17 : N , CA , CD2 , NE2 , CE1 , ND1 , CG , CB , C , O 
SAVI8:ASN E18 :N, CA,ND2 ,0D1, CG, CB, C,0 
55 SAVI8:THR E22 :N,CA,CG2 f OGl,CB, C, 0 
SAVI8:THR E3 8 : N , CA , CG2 , OG1 , CB , C , 0 
SAVI8:HIS E39:N,CA,CD2,NE2,CE1,ND1,CG,CB,C,0 





LYS 


245 




asn" 


'246 




thr" 


"247 




ALA 248 


5 


THR 249 




SER 


250 




LEU" 


251 




gly" 


"252 




SER 253 


10 


THR 254 




ASN 255 




LEU 


256 




TYR" 


"257 




gly" 


258 


15 


SER" 


"259 




gly" 


"260 




leu" 


"261 




VAL 262 




ASN 263 


20 


ALA 264 




GLU 


265 




ALA" 


266 




ALA" 


267 




THR~ 


"268 


25 


ARG 269 




ION 270 




ION 271 
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SAVI8 
SAVI8 
SAVI8 
SAVI8 
5 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 

10 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 

15 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 

20 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 

25 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 

30 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 

35 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 

40 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 

4 5 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 

50 SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 

55 SAVI8 
SAVI8 
SAVI8 



PRO 
LEU 
ASN 
ALA 
ALA 
LEU 
ASN 
LEU 
GLY 
VAL 
ALA 
PRO 
SER 
VAL 
SER 
ALA 
GLN 
LEU 
GLU 
GLY 
ASN 
ALA 
SER 
PRO 
SER 
PRO 
SER 
ALA 
THR 
LEU 
GLU 
GLN 
ALA 
VAL 
ASN 
SER 
ALA 
THR 
SER 
VAL 
VAL 
SER 
GLY 
ALA 
GLY 
SER 
ILE 
SER 
TYR 
PRO 
ARG 
TYR 
ASN 
THR 
ASP 
GLN 
ASN 



E40 
E42 
E43 
E73 
E74 
E75 
E76 
E82 
E83 
E84 
E85 
E86 
E103 
E104 
E105 
El 08 
E109 
Elll 
E112 
E115 
E116 
E122 
E128 
E129 
E130 
E131 
E132 
E133 
E134 
E135 
E136 
E137 
E138 
E139 
E140 
E141 
E142 
E143 
E144 
E149 
E150 
E156 
E157 
E160 
E161 
E162 
E165 
E166 
E167 
E168 
E170 

E171: 
E173:N 
E180:N 
E181:N 
E182:N 
E183:N 



:N,CD,CA, CG,CB,C,0 
:N,CA,CD2,CD1,CG,CB,C,0 
:N,CA,ND2,0Dl,CG f CB # C,0 
:N,CA,CB,C,0 
:N,CA,CB,C,0 
:N,CA,CD2 ,001,00,06,0,0 
:N, OA, ND2, 001,02,06,0,0 
:N,CA,CD2,CD1,CG,CB,C,0 
:N r CA,C,0 

: N, OA, CG2, CGI, 03,0,0 
:N,CA,CB,C,0 
:N,CD,CA,CG,OB,C,0 
N,CA,0G,CB,C,0 

0A,CG2,CGl,CB,C,O 
CA,0G,CB,C,0 
CA,CB,C,0 

OA , NE2 , OEl , CD , CG , CB , 0 , O 
OA , CD2 , CD1 , CG , CB , C , 0 
CA,OE2,OEl,CD,CG,CB,C,0 
OA, 0,0 

CA,ND2,ODl,CG,CB,C,0 
CA,CB,C,0 
CA,0G,CB,C,0 
CD,CA,CG,CB,C,0 
CA,OG,CB,C,0 
OD,CA,CG,CB,C,0 
CA,OG,CB,C,0 
CA,CB,C,0 
CA,CG2,0G1,CB,C,0 
OA , CD2 , CD1 , CG , CB , C , O 
CA,OE2,OEl,CD,CG,CB,C,0 
CA,NE2,OEl,CD f CG,CB,C,0 
CA,CB,C,0 
OA, CG2,CG1,CB, 0,0 
CA,ND2,ODl,CG, CB,C,0 
CA,OG,CB,C,0 
CA f CB,C f O 
CA,CG2,OGl,CB,C,0 
CA,OG,CB,C,0 
CA,CG2,CG1,CB,C,0 
CA,CG2,CG1,CB,C,0 
CA,OG,CB,C,0 
CA,C,0 
CA,CB,C,0 
CA,C,0 

CA,0G,0B,C,O 
OA, CD1 , CGI , CB, CG2 ,0,0 
CA,0G,CB,C,0 

CA,OH,CZ,CD2,CE2,CEl,CDl,CG,CB,C,0 
CD,CA,CG,CB,C,0 
CA,NH2,NH1,CZ,NE,CD,CG,CB,C,0 
N,OA,OH,CZ,CD2,CE2,CE1,CD1,CG,CB,C,0 



CA 
CA 
CA 
CA 
CA 



ND2,0D1,CG,CB,C,0 
CG2,OGl,CB,C,0 
0D2,0D1,CG,CB,C,0 
NE2 , OEl , CD , CG , OB , C , O 
ND2,OD1,CG,CB,C,0 
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SAVI8 : 


ASN 


E184 : 


N , CA , ND2 , QUI , CG , CB # t , U 




SAVI8 : 


ASN 


E185 : 


N , CA , ND2 , OD1 , CG , CB , C , O 




SAVI8: 


ARG 


E186 : 


N , CA , NH2 r Nnl , CZ , NE , CD , Cls , to , t , u 




SAVI8 : 


IV T 7\ 

ALA 


til Ot i 

£187 : 


N , CA , CB ,C,0 


5 


SAVI8: 


SER 


E188 : 


N , CA , OG r CB , C , O 




SAVI8 : 


SER 


E190 : 


N , CA , OG , CB , C , O 




SAVI8: 


GLN 


E191 ; 


ft TV Mf!t*^ ftft. ^*ft ftD 

N , CA , NE2 , OE1 , CD , CG , CB ,C f O 




SAVI8: 


.TYR 


E192 : 


XT ft IV All ft rr ft FN ft f O OX* 1 ^>r> 1 pp OD ft /"N, 

N , CA , OH , CZ , CD2 , CE2 r C£*l , CD1 , tv» , to , C , U 




SAVI8: 


.ALA 


E200 : 


N , CA , CB , C , O 


10 


SAVI8 


: VAL 


E203 ; 


N , CA , CG2 , CGI , CB , C , O 




SAVI8 


: ASN 


E204 : 


xt ft tv mm ftr>. i ft/"* on ft O 

N r CA , ND2 , OD1 , CG , CB , C , U 




SAVI8 


•GLN 


E206 : 


> N , CA , NE2 , 0E1 , CD , CG , CB , C , O 




SAVI8 


:GLY 


E211; 


N,CA, C,0 




SAVI8 


:S£R 


E212 ; 


N, CA # 0G,CB,C,0 


15 


SAVI8, 


. THR 


E213 * 


N f CA , CG2 , 0G1 , CB , C , 0 




SAViS- 


ALA 


E2 15 


N ; CAy CB -,.C,.0 




O 1VT7T D « 


SER 


ttZ lb \ 


M TV ftp pn /*\ 
N , f Ulj / tO , t / U 




SAVI8 ' 


VAL 


E227< 


N,CA,CG2,CGl f CB^,© 




SAVI8 : 


ALA 


E228 


N.CA^CB^^O 


20 


SAVI8: 


GLY 


E229. 


N.CA^O 




SAVI8: 


ALA 


E230: 


N,CA,CB r C,0 




SAVI8: 


THR 


E255: 


N,CA, 062,001,06^,0 




SAVI8 : 


SER 


E256. 


N,CA,0G,CB,C,0 




SAVI8: 


:LEU 


E257: 


N,CA,CD2,CD1,CG,CB,C,0 


25 


SAVI8 


: GLY 


E258: 


N,CA,C,0 




SAVI8 


:SER 


E259: 


N,CA,0G,CB,C,0 




SAVI8 


:ASN 


E261' 


N,CA,ND2,0Dl,CG f CB,C,0 




SAVI8 


:LEU 


E262 


N,CA,CD2,CD1,CG,CB,C,0 




SAVI8 


:LEU 


E267: 


^^,002,001,00,06,0,0 


30 


SAVI8 


:VAL 


E268 


:N,CA,CG2,CG1,CB,C,0 




SAVI8 


;ASN 


E269 


:N,CA,ND2,0D1,CG,CB,C,0 



Subset SUB5B: 

sub5bmole . list 
Subset SUB5B: 

35 SAVI8 : E2-E4 , E16 , E19-E21 , E23-E24 , E28 , E37 , E41 , E44-E45 , 
E77-E81,E87-E88, 

SAVI8 : E90 , E113-E114 , E117-E118 , E120-E121 , E145- 
E148 , E169 , E172 , E174-E176 , 
SAVI8 : E193-E196 , E198-E199 , E214 , E231- 
40 E234,E236,E243,E247,E250,E253-E254 I 

SAVI8:E260,E263-E266,E270-E273,M276H-M277H 

subSbatom. list 
Subset SUB5B: 

SAVI8:GLN E2 :N, CA,NE2 , 0E1 , CD,CG, CB, C, O 
45 SAVI8TSER E3 :N,CA,0G,CB, C,0 

SAVI8:VAL E4 :N, CA, CG2 , CGI , CB, 0,0 
SAVI8:ALA E16 :N, CA,CB, 0,0 

SAVI8:ARG E19:N,CA,NH2,NH1,C2,NE,CD,CG,CB,C,0 

SAVI8:GLY E20:N,CA,C f O 
50 SAVI8:LEU E21:N,0A,CD2,CDl r CG,CB,C,O 

SAVI8:GLY E23:N,CA,C,0 

SAVI8:SER E24 :N, CA,0G, CB, 0,0 

SAVI8:VAL E28 :N,CA,CG2,CG1,CB,C,0 

SAVI8:SER E37 :N, CA,OG,CB, 0,0 
55 SAVI8:ASP E41:N,CA,OD2 ,0D1,CG,CB,C,0 

SAVI8:ILE £44^,0^,001,061,06,002,0,0 

SAVI8:ARG E45 :N, CA,NH2 , NH1 , CZ , NE, CD,CG, CB, 0,0 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 
SAVI8 



ASN E77:N,CA # ND2,0D1,CG,CB,C,0 

SER E78:N,CA,0G,CB,C,0 

ILE E79:N,CA, 001,001,08,002,0,0 

GLY E80:N,CA,C,O 

VAL E81:N,CA,CG2,CG1,CB,C,0 

SER E87:N,CA,OG,CB,C,0 

ALA E88:N,CA,CB,0,0 

LEU E90:N, CA, CD2 , CD1, CG, CB, 0,0 



TRP E113 
ALA E114 
ASN E117 
GLY El 18 
HIS E120 
VAL E121 
ARG E145 
GLY El 4 6 
VAL EI47 
LEU E148 
ALA E169 
ALA E172 
ALA E174 
MET E175 
ALA E176 
GLY E193 
ALA E194 
GLY El 9 5 
LEU El 9 6 
ILE E198 
VAL E199 
TYR E214 
ALA E231 
ALA E232 
LEU E233 
VAL E234 
GLN E236 
E243 
E247 
E250 
E253 



ASN 
ARG 
LEU 
THR 



ALA E254 
THR E260 



:N 


,CA, 


:N 


,CA, 


:N 


rCA, 


:N 


rCA, 


:N 


, OA , 


:N 


rCA, 


:N 


rCA, 


:N 


rCA, 


:N 


rCA, 


:N 


rCA, 


:N 


OA, 


:N 


rCA, 


:N 


rCA, 


:N 


rCA, 


:N 


f CA, 


:N 


,CA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N, 


OA, 


:N 


OA, 


:N, 


OA, 


:N, 


OA, 


:N i 


OA, 


:N 


OA, 


:N 


OA, 


:N, 


OA, 


:N 


OA, 


:N, 


OA, 



, CD2 , CE2 , NE1 , CD1, CG, CE3 , CZ3 , CH2 , CZ2 , CB , C, O 
,CB,C f O 

,ND2,0D1,CG,CB,C,0 
,C,0 

, CD2 , NE2 , CE1 , ND 1 , CG , CB , C , O 
,CG2,CG1,CB,C,0 
,NH2,NH1,CZ,NE,CD,CG,CB,C,0 

,c,o 

,CG2,CG1,0B,C,0 
,CD2,CD1,CG,CB,C,0 
,CB, 0,0 
,CB,C,0 
,CB,C,0 

,CE,SD, 00,06,0,0 
,CB,0,O 
,0,0 
,CB,C,0 
,0,O 

,CD2,CD1,CG,CB,C,0 
, CD1 , CGI , CB , CG2 , C , O 
,CG2,CG1,CB,C,0 

, OH , CZ , CD2 , CE2 , CE1 , CD1 , CG , CB , C , O 
,CB,C,0 
,CB,C,0 

,CD2,CD1,CG,CB,C,0 
,CG2,CG1,CB,C,0 
,NE2,0E1,CD,CG,CB,C,0 
,ND2,0D1,CG, CB,C,0 
,NH2,NH1,CZ,NE,CD,CG,CB,C,0 
,CD2,CD1,CG,CB,C,0 
,CG2,0G1,CB,C,0 
,CB,C,0 

,CG2,0G1,CB,C,0 

,0H,CZ,CD2,CE2,CE1,CD1,CG,CB,C,0 
,0,0 

,0G,CB,C,0 
,C,0 
,CB,C,0 

,0E2,0E1,CD,CG,CB,C,0 
,CB,C,0 
,CB,C,0 



TYR E263: 
GLY E264: 
SER E265: 
GLY E266: 
ALA E270: 
GLU E271: 
ALA E272: 
ALA E273: 
:I0N M276H:CA 
:I0N M277H:CA 
Subset ACTSITE: 

actsitemole. list 
Subset ACTSITE: 

55 SAVI8:E29-E35,E48-E51,E54,E58-E72,E91-E102,E106-E107,E110,E123- 
E127, 
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SAVI8: E151-E155,E177-E179,E189,E201-E202,E205,E207-E210,E217- 
E226 

actsiteatom . 1 ist 
5 Subset ACTSITE: 





OAVIO i 


AT A 




M OA PR P Cl 








C AT7TQ • 


VAXj 


CiJ U , 










oAVlo . 


T T?TT 


r*J I . 


m pa pno pm p^ pr p o 








C* * TTT o « 


a cd 
Aor 


T?*a o < 


u pa nr^o om pit PR P O 






10 


SAVI 8 : 


THR 


£.J J : 


M PA npo rtPI PR P 








SAVI 8 : 


f*T V 

GLiX 


pi >i , 


It , t. A , U , U 








O H. TTT O • 

SAVI 8 : 


TT TP 


riJD : 


m rn PIM PPT PR PfI9 P O 








SAVI 8 : 


T* T H. 

AIiA 


£48 3 


W , CA , CD , v*. , U 








CMTTQ « 


SER 




N,CA,0G,CB,C,0 






1 c 


OAVlo « 


fntL 




N, OA, CD 2 , CE2 , CZ , CE1 , CD1 , 


CG, 


CB , 0 , 0 




dAVIo « 


1.7 AT 


T7£x1 « 


N,CA,CG2,CG1,CB,C,0 












£io4 i 


N,CA,OE2,OEl,CD,CG,OB,C, 


G 






oAVlo « 


TWO 
A firs. 


PRO « 


N,CA,CG2,0G1,CB,C,0 








DAVID « 


CT M 




N,CA,NE2,0E1,CD,CG,CB,C, 


0 




& u 


Onv X O • 




LOU < 


N, CA,OD2 ,0D1, CG,CB,C,0 








oAVlo . 


IjXjX 




N,CA,C,0 








CIVTfl « 
SAVXO « 


ACM 

Aon 


xVOiS < 


N,CA,ND2,0Dl,CG,0B,C,O 








CA\7Tn « 


IsXjX 


E63: 


N,CA,C,0 








OnvlO « 




E64: 


;N,CA,CD2,NE2,CE1 / ND1,CG, 


CB, 


CO 


ZD 


ciVTfi ■ 
OnVIO « 




E65! 


:N,CA,C,0 








eavTft ' 

Dnvlo i 


> THR 
x ntv 


E66: 


.N,CA,CG2,0G1,CB,C,0 








SAVT8 « 


HTS 
> nx 


E67! 


:N,CA,CD2,NE2,CE1,ND1,CG, 


CB, 


0,0 




SAVI 8 : 


: VAL 


E68: 


N r CA,CG2, 061,06,0,0 








SAVI 8 : 


: ALA 


E69: 


^,0^,06^,0,0 






30 


SAVI 8 : 


:GLY 


E70: 


:N f CA # C f 0 








SAVI 8 : 


:THR 


E71: 


:N,CA,CG2,0G1,CB,C,0 








SAVI 8 : 


; ILE 


E72< 


:N,CA,CD1,CG1,CB,CG2,C,0 








SAVI 8: 


:TYR 


E91< 


i N , CA , OH , CZ , CD2 , CE2 , CE1 , CD 1 , 


CG,CB,C,0 




SAVI 8 , 


: ALA 


E92 


:N,CA,CB,C,0 






35 


SAVI 8 


: VAL 


E93 


:N,CA,CG2,CG1,CB,C,0 








SAVI 8' 


iLYS 


E94 


:N,CA,NZ,CE f CD,CG,CB,C,0 








SAVI 8 


:VAL 


E95 


:N,CA,CG2,CG1,CB P C,0 








SAVI 8 


:LEU 


E96 


:N,CA,CD2,CDl,CG,0B,C,O 








SAVI 8 


:GLY 


E97 


:N,CA,C,0 






40 


SAVI 8 


£ ALA 


E98 


:N,CA,CB,C,0 








SAVI 8 


:SER 


E99 


:N,CA,0G,CB,C,0 








SAVI 8 


:GLY 


E100:N,CA,C,0 








SAVI 8 


:SER 


E10l:N,CA,OG,CB,C,O 








SAVI 8 


:GLY 


E102:N,CA,C,O 






45 


SAVI 8 


:SER 


E106:N,CA,OG,CB,C,O 








SAVI 8 


:ILE 


E107:N,CA,CDl,CGl,CB,CG2,C,O 






SAVI 8 


:GLY 


E110:N,CA,C,0 








SAVI 8 


:ASN 


E123:N,CA,ND2,0D1,CG,CB,C,0 








SAVI 8 


:LEU 


E124:N,CA,CD2,CD1,CG,CB,C,0 






50 


SAVI 8 


:SER 


E125:N,CA,0G,CB,C,0 








SAVI 8 


:LEU 


E126:N,CA,CD2,CD1,CG,CB,C,0 








SAVI 8 


:GLY 


E127:N,CA,C,0 








SAVI 8 


: ALA 


E151:N,CA,CB,C,0 








SAVI 8 


: ALA 


E152:N,CA,CB,C,0 






55 


SAVI 8 


:SER 


E153:N,CA,OG,CB,C,0 








SAVI 8 


:GLY 


E154:N,CA,C,0 








SAVI 8 


:ASN 


E155:N,CA,ND2,ODl f CG,CB,C,0 







- WO 98/35026 



PCT/DK98/00046 



SAVI8:VAL E177 :N, CA,CG2 , CGI, CB, C,0 
SAVI8:GLY E178 :N, CA, C,0 
SAVI8 : ALA E179 :N, CA, CB, C, 0 

SAVI8:PHE E189:N, CA,CD2 , CE2 ,CZ, CE1, CD1, CG, CB, C,0 
5 SAVI8:PRO E201:N,CD, CA,CG,CB,C,0 

SAVI8:GLY £202 :N, CA, C,0 

SAVI8:VAL E205:N,CA,CG2,CGl,CB,C,O 

SAVI8:SER E207 :N, CA,OG, CB, C,0 

SAVI8:THR E208 :N,CA,CG2,0G1,CB,C,0 
1 0 S AVI 8 : TYR £2 0 9 : N , OA , OH , CZ , CD2 , CE2 , CE1 , CD 1 , CG , CB , C , O 

SAVI8:PR0 E210:N,CD,CA,CG,CB,C,0 

SAVI 8 : LEU E2 17 : N , CA , CD2 , CD1 , CG , CB , C , O 

SAVI 8 : ASN £2 18 : N , CA , ND2 , OD 1 , CG , CB , C , O 

SAVI8:GLY E219:N, CA,C,0 
15 SAVI8:THR E220:N, CA,CG2 ,OGl,CB, C,0 

SAVI8:SER £221 :N, CA,OG, CB, C,0 

SAVI 8 : MET E2 2 2 : N , CA , CE , SD , CG , CB , C , O 

SAVI8:ALA E223 :N, CA, CB, C, O 

SAVI8:THR E224 :N, CA, CG2 ,OGl , CB, C, O 
20 SAVI8:PR0 E225 :N, CD, CA, CG, CB, C, 0 

SAVI8:HIS E226:N,CA,CD2,NE2,CE1,ND1,CG,CB,C,0 
Subset RESTx: 

restxmole . list 
Subset RESTX: 
25 NEWMODEL: E5 , E13-E14 , E22 , E38-E40 , 

E42,E73-E76, E82-E86 , E103-E105 , 

NEWMODEL: E108 , E122 , E133-E135, E137-E140 , 
E149-E150 , E173 , E204 , E206 , 

NEWMODEL: E211-E213 , E215-E216 , £227- E229 , 
30 E258,E269 
restxatom. list 
Subset RESTX: 

NEWMODEL : PRO £5 :N,CD,CA,CG,CB,C,0 

NEWMODEL : ALA E13 :N, CA,CB, C,0 
35 NEWMODEL : PRO E14 :N,CD,CA,CG,CB, C,0 

NEWMODEL : THR E22:N, CA,CG2 ,0G1,CB,C,0 

NEWMODEL : THR E38 :N, CA,CG2 ,OGl, CB, C, O 

NEWMODEL : HI S E3 9 : N , CA , CD2 , NE2 , CE1 , ND1 , CG , CB , C , 0 

NEWMODEL : PRO E40:N,CD,CA, CG,CB, C,0 
40 NEWMODEL : LEU E42:N,CA,CD2 , CD1 , CG, CB, C, O 

NEWMODEL : ALA E73 :N,CA,CB,C,0 

NEWMODEL : ALA E74 :N, CA,CB, C,0 

NEWMODEL : LEU £75 :N, CA,CD2 , CD1 , CG , CB , C , 0 

NEWMODEL: ASN E7 6 :N, CA,ND2 ,0D1 , CG, CB, C, 0 
45 NEWMODEL : LEU E82 :N, CA,CD2 ,CD1,CG, CB,C,0 

NEWMODEL : GLY E83:N,CA,C,0 

NEWMODEL: VAL E84 :N, CA, CG2 , CGI , CB, C, O 

NEWMODEL : ALA E85:N,CA,CB,C,0 

NEWMODEL : PRO E86:N, CD,CA,CG,CB, C,0 . 
50 NEWMODEL : SER E103 :N, CA,OG,CB, C,0 

NEWMODEL : VAL El 04 : N , CA , CG2 , CGI , CB , C , O 

NEWMODEL : SER E105 :N, CA, OG, CB, C, 0 

NEWMODEL : ALA E108 :N, CA,CB,C,0 

NEWMODEL : ALA E122 :N,CA,CB,C,0 
55 NEWMODEL : ALA E133 : N , CA, CB, C, O 

NEWMODEL: THR El 3 4 :N, CA, CG2 ,0G1, CB,C,0 

NEWMODEL : LEU E13 5 : N , CA , CD2 , CD 1 , CG , CB , C , O 
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15 



10 



5 



NEWMODEL : GLN 
NEWMODEL : ALA 
NEWMODEL : VAL 
NEWMODEL :ASN 
NEWMODEL: VAL 
NEWMODEL: VAL 
NEWMODEL :ASN 
NEWMODEL :ASN 
NEWMODEL: GLN 
NEWMODEL : GLY 
NEWMODEL : SER 
NEWMODEL : THR 
NEWMODEL : ALA 
NEWMODEL: SER 
NEWMODEL: VAL 
NEWMODEL: ALA 
NEWMODEL: GLY 
NEWMODEL: GLY 
NEWMODEL :ASN 



E137 
E138 
E139 
E140 
E149 
E150 
E173 
E204 
E206 
E211 
E212 
E213 
E215 
E216 
E227 
E228 



E229 
E258 
E269 



n,ca,ne2,0e1,cd, cg,cb,c,0 

n,ca,cb,c,o 

n , ca , cg2 , cgi , cb , c , o 

n,ca,nd2,0d1,cg,cb,c,0 

n,ca,cg2,cg1,cb,c,0 

n,ca,cg2,cg1,cb,c,0 

n,ca,nd2,0d1,cg,cb,c,0 

N,CA,ND2,ODl,CG,CB,C,0 

N,CA,NE2,0E1,CD,CG,CB,C,0 

N,CA,C,0 

N,CA,0G,CB,C,0 

N,CA,CG2,OG1,CB,C,0 

N,CA,CB,C,0 

N, CA,0G,CB, 0,6 

N,CA,CG2,CG1,CB,C,0 - 

N,CA,CB,C,0 

N , CA, C, O 

N,CA,C,0 

N,CA,ND2,0D1,CG,CB,C,0 



20 



Example 3 

Suitable substitutions in PD498 for addition of carboxvlic acid 
attachment groups f-COOHl 

The 3D structure of PD498 was modeled as described in 
25 Example 1. 

Suitable locations for addition of carboxylic attachment groups 
(Aspartatic acids and Glutamic acids) were found as follows. 
The procedure described in Example 1 was followed. The 
commands performed in Insight (BIOSYM) are shown in the command 
30 files raakeDEzone.bcl and makeDEzone2 .bcl below: 



Conservative substutitions: 

makeDEzone . bcl 

Delete Subset * 
35 Color Molecule Atoms * Specified Specification 255,0,255 

Zone Subset ASP :asp:od* Static monomer /residue 10 Color_Subset 
255,255,0 

Zone Subset GLU :glu:oe* Static monomer /residue 10 Color_Subset 
255,255,0 

40 #N0TE: editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERM : 280:0 Static monomer /residue 10 Color_Subset 
255,255,0 

#N0TE: editnextline ACTSITE residues according to the protein 
45 Zone Subset ACTSITE : 39, 72, 226 Static monomer /residue 8 

Color_Subset 255,255,0 

Combine Subset ALLZONE Uni n ASP GLU 

Combine Subset ALLZONE Union ALLZONE CTERM 

Combine Subset ALLZONE Union ALLZONE ACTSITE 
50 #N0TE: editnextline object name according to the protein 

Combine Subset REST Difference PD 4 9 8 F I N ALMODEL ALLZONE 
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List Subset REST Atom Output File restatom. list 
List Subset REST monomer/ residue Output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
List Subset ACTSITE Atom Output File actsiteatom. list 
5 List Subset ACTSITE monomer/ residue Output_File 
acts itemole. list 
# 

Zone Subset REST5A REST Static Monomer /Residue 5 -Color_Subset 
Combine Subset SUB 5 A Difference REST5A ACTSITE 

10 Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
List Subset SUB5B Atom Output File subSbatom. list 
List Subset SUB5B monomer /residue Output_File subSbmole. list 
#Now identify sites for asn->asp & gln->glu substitutions and 

15 ... 

/con t inue with makezone2.bcl. 

#Use grep command to identify asn/gln in restatom. list — 
#sub5batom. list & accsiteatom. list 

20 Comments: 

The subset REST contains Gln33 and Asn245, SUB5B contains 
Glnl2, Glnl26, Asn209, Gln242, Asn246, Gln248 and Asn266, all 
of which are solvent exposed. 

The substitutions Q12E or Q12D, Q33E or Q33D, Q126E or 
25 Q126D, N209D or N209E, Q242E or Q242D, N245D or N245E, N246D or 
N246E, Q248E or Q248D and N266D or N266E are identified in 
PD498 as sites for mutagenesis within the scope of this 
invention. Residues are substituted below in section 2, and 
further analysis done: 



Non-conservative substitutions: 
makeDEzone2 .bcl 

#sourcefile makezone2 .bcl Claus von der Osten 961128 

# ... 
35 #having scanned lists (grep gln/asn command) and identified 

sites for . . . 

#asn->asp & gln->glu substitutions 

#N0TE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace PD4 9 8 FINALMODEL newmodel 
40 Biopolymer 

#N0TE: editnextline object name according to protein 
Blank Object On PD498FINALMODEL 

#N0TE: editnextlines with asn->asp & gln->glu positions 

Replace Residue newmodel: 33 glu L 
45 Replace Residue newmodel: 245 asp L 

Replace Residue newmodel: 12 glu L 

Replace Residue newmodel: 12 6 glu L 

Replace Residue newmodel: 209 asp L 

Replace Residue newmodel: 242 glu L 
50 Replace Residue newmodel: 246 asp L 

Replace Residue newmodel: 248 glu L 
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Replace Residue newmodel: 266 asp L 
# 

#Now repeat analysis done prior to asn->asp & gln->glu, . . . 
#now including introduced asp & glu 
5 Color Molecule Atoms newmodel Specified Specification 255,0,255 
Zone Subset ASPx newmodel: asp :od* Static monomer/residue 10 
Color_Subset 255,255,0 

Zone Subset GLUx newmodel: glu roe* Static monomer/ residue 10 
Color_Subset 255,255,0 
10 #NOTE: editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERMx newmodel : 280 :0 Static monomer/ residue 10 
Color_Subset 255,255,0 

#NOTE: editnextline ACTSITEx residues according to the .protein 
15 Zone Subset ACTSITEx newmodel: 3 9, 72, 22 6 Static monomer/ residue 

8 Color_Subset 255,255,0 

Combine Subset ALLZONEx Union ASPx GLUx 

Combine Subset ALLZONEx Union ALLZONEx CTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
20 Combine Subset RESTx Difference newmodel ALLZONEx 

List Subset RESTx Atom Output File restxa torn, list 

List Subset RESTx monomer /residue Output_File restxmole. list 

# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
25 List Subset ACTSITEx Atom Output File actsitexatom. list 
List Subset ACTSITEx monomer/residue Output_File 
actsitexmole. list 
# 

#read restxatom. list or restxmole. list to identify sites for 
30 (not_gluasp) ->gluasp ... 
#subst. if needed 

Comments : 

The subset RESTx contains only two residues: A233 and G234, 

35 none of which are solvent exposed. No further mutagenesis is 

required to obtain complete protection of the surface. 

However, it may be necessary to remove some of the reactive 

carboxylic groups in the active site region to ensure access to 

the active site of PD498. Acidic residues within the subset 

40 ACTSITE are: D39, D58, D68 and D106. Of these only the two 

latter are solvent exposed and D39 is a functional residue. The 

mutations D68N, D68Q, D106N and D106Q were found suitable 

according to the present invention. 

Relevant data for Example 3: 

45 Solvent accessibility data for PD498MODEL: see Example 1 above. 

Subset REST: 

restmole. list 
Subset REST: 

PD498FINALMODEL: 10-11, 33-35, 54-55, 129-130, 
50 221,233-234,236,240,243, 

PD4 9 8 FINALMODEL : 245 , 262 ,264-265 
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restatom. list 

Subset REST: 

PD4 9 8FINALMODEL : ALA 
5 PD4 9 8 FINALMODEL : TYR 

PD4 9 8 FINALMODEL : GLN 

PD4 9 8 FINALMODEL : THR 

PD498 FINALMODEL : VAL 

PD4 98 FINALMODEL: ILE 
10 PD4 9 8 FINALMODEL :LYS 

PD4 9 8 FINALMODEL : LYS 

PD4 9 8 FINALMODEL : VAL 

PD4 9 8 FINALMODEL : TYR 

PD4 9 8 FINALMODEL : ALA 
15 PD4 9 8 FINALMODEL :GLY 

PD4 9 8 FINALMODEL : ALA 

PD4 9 8 FINALMODEL : ALA 

PD498 FINALMODEL : GLY 

PD498 FINALMODEL : ASN 
2 0 PD4 9 8 FINALMODEL: GLY 

PD4 9 8 FINALMODEL : GLY 

PD4 9 8 FINALMODEL : THR 
Subset SUB5B: 
subSbmole . list 
25 Subset SUB5B : 

PD4 9 8 FINALMODEL: 6-9 , 12-13 ,31-32 , 51-53 , 56 , 81,93-94 , 97- 

99,122,126-128, 

PD4 9 8 FINALMODEL: 13 1 , 155-157 , 159 , 197-199 , 209 , 211 , 219- 
220,232,235, 

30 PD4 9 8 FINALMODEL: 237-239, 241-242 ,244, 246-249, 253,260- 

261,263,266-268 
subSbatom. list 

Subset SUB5B: 

PD4 9 8 FINALMODEL : PRO 6 : N , CA , CD , C , 0 , CB , CG 
35 PD4 9 8 FINALMODEL : TYR 7:N,CA r C,O,CB,CG,CDl,CD2,CEl,CE2,CZ,0H 

PD4 9 8 FINALMODEL: TYR 8 : N , CA ,C,O f CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL : SER 9 : N , CA , C , O , CB r OG 

PD4 9 8 FINALMODEL : GLN 12 : N , CA , C , O , CB , CG , CD , OE1 , NE2 

PD 4 9 8 FINALMODEL : TYR 13 : N , CA , C r O , CB , CG , CD1 , CD 2 , CE1 , CE2 , CZ , OH 
40 PD4 9 8 FINALMODEL: SER 3 1 : N , CA r C , O , CB , OG 

PD4 9 8 FINALMODEL: THR 32 :N, CA,C,0,CB,OGl,CG2 

PD4 9 8 FINALMODEL :ARG 51 :N, CA, C, 0, CB, CG, CD,NE, CZ ,NH1 ,NH2 

PD4 9 8 FINALMODEL : LYS 52 : N, CA, C,0, CB, CG, CD, CE,NZ 

PD4 9 8 FINALMODEL: VAL 53 :N,CA,C,0,CB,CG1,CG2 
45 PD4 9 8 FINALMODEL: GLY 56:N,CA,C,0 

PD4 9 8 FINALMODEL: ALA 81: N, CA, C, O, CB 

PD4 9 8 F INALMODEL : MET 9 3 : N , CA , C , O , CB , CG , SD , CE 

PD4 9 8 FINALMODEL: ALA 94 :N, CA, C, O, CB 

PD4 9 8 FINALMODEL: THR 97 : N , CA , C , O , CB , OG1 , CG2 
50 PD498FINALMODEL:LYS 98 :N, CA, C,0, CB, CG, CD, CE, NZ 

PD4 9 8 FINALMODEL: ILE 99 :N, CA, C,0, CB, CGI, CG2 , CD1 

PD4 98 FINALMODEL : TYR 12 2 : N , CA , C , O , CB , CG , CD1 , CD 2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL : GLN 1 2 6 : N , CA , C , O , CB , CG , CD , OE1 , NE2 

PD4 9 8 FINALMODEL: GLY 127:N,CA,C,0 
55 PD4 9 8 FINALMODEL: ALA 128 :N , CA, C, O, CB 

PD4 9 8 FINALMODEL : LEU 1 3 1 : N , CA , C , O , CB , CG , CD1 , CD2 

PD4 9 8 FINALMODEL: GLY 155:N,CA,C,0 



10:N,CA,C,0,CB 

11:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

33:N,CA,C,0,CB,CG,CD f OEl,NE2 

34:N,CA,C,0,CB,0G1,CG2 

35:N,CA,C,0,CB,CG1,CG2 

54:N,CA,C,0,CB,CG1,CG2,CD1 

55:N,CA,C,0,CB,CG,CD,CE,NZ 

129:N,CA,C,0,CB,CG,CD,CE,NZ 

130:N,CA,C,O,CB,CGl,CG2 

221:N,CA,C,O,CB,CG,CDl,CD2,CEl,CE2,CZ,0H 

233:N,CA,C,0,CB 

234:N,CA,C,0 

236:N,CA,C,0,CB 

2 4 0 : N , CA , C , O , CB 

243:N,CA,C,0 

245:N,CA,C,O,CB,CG,0Dl,ND2 
262:N,CA,C,0 
264:N,CA,C,0 
265:N,CA,C,0,CB,0G1,CG2 
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PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
PD498FINALMODEL: 
PD4 9 8 FINALMODEL : 
5 PD4 9 8 FINALMODEL: 
PD4 98 FINALMODEL : 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
10 PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
15 PD4 9 8 FINALMODEL: 
PD4 9 8 F INALMODEL : 
PD4 98 FINALMODEL: 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
20 PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL: 
25 PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL : 
PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL: 
30 Subset ACTSITE: 

acts itemole. list 
Subset ACTSITE: 

PD4 9 8 FINALMODEL: 36-42 , 57-60 , 66-80 , 100-110 , 
115-116 , 119 , 132-136, 160-164 , 
35 PD4 98 FINALMODEL: 182-184 , 194,206-207 ,210, 

212-215,222-231 
actsiteatom. list 
Subset ACTSITE: 

PD4 98 FINALMODEL: ALA 36:N,CA, C,0,CB 
40 PD4 9 8 FINALMODEL : VAL 37 :N, CA, C,Q,CB, CGI, CG2 

PD4 9 8 FINALMODEL: LEU 38 :N, CA,C,0,CB, CG,CD1,CD2 
PD4 9 8 FINALMODEL: ASP 39 :N, CA,C,0,CB, CG,ODl,OD2 
PD4 9 8 FINALMODEL: SER 40:N,CA,C,O,CB,OG 
PD4 9 8 FINALMODEL :GLY 41:N f CA,C,0 
45 PD4 9 8 FINALMODEL: VAL 42 :N,CA,C,0,CB, CG1,CG2 

PD498 FINALMODEL : TYR 

57:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 98 FINALMODEL : ASP 58 : N , CA , C , O , CB , CG , ODl , OD2 
PD498FINALMODEL: PHE 
50 59:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

PD4 9 8 FINALMODEL : ILE 60 : N,CA, C,0,CB, CGI , CG2 , CD1 
PD498 FINALMODEL : PRO 6 6 : N , CA , CD , C , 0 , CB , CG 
PD498FINALMODEL:MET 67 :N,CA,C,0,CB, CG, SD, CE 
PD4 9 8 FINALMODEL : ASP 68 : N, CA, C, O, CB, CG , ODl , OD2 
55 PD4 9 8 FINALMODEL: LEU 69 : N , CA , C , 0 , CB , CG , CD1 , CD2 

PD4 98 FINALMODEL : ASN 7 0 : N , CA , C , 0 , CB , CG , ODl , ND2 
PD4 9 8 FINALMODEL :GLY 71:N,CA,C,0 



ALA 


156 


:N,CA,C, 




,CB 








VAL 


157 


:N,CA,C J 




CB,CG1,CG2 








VAL 


159 


:N,CA,C, 


r o. 


CB,CG1,CG2 








TYR 


197 


:N,CA,C, 


r o, 


CB,CG,CD1,CD2,CE1 


,CE2, 


CZ, 


OH 


GLY 


198 


:N,CA,C 1 


r 0 










THR 


199 


:N,CA,C, 


rO ( 


CB,0G1,CG2 








ASN 


209 


\U,CA, Ct 




CB,CG,0D1,ND2 








ALA 


211 


;N,CA,Cj 


f o, 


CB 








TYR 


219 


:N,CA,C i 




CB, CG, CD1, CD2 , CE1 


,CE2, 


CZ, 


OH 


SER 


220 


:N,CA,C, 




CB,0G 








VAL 


232: 


'N,CA,C, 


fO, 


CB,CG1,CG2 








LEU 


235 


>N,CA,C, 


o, 


CB,CG,CD1,CD2 








ALA 


237' 


;N,CA,C, 


o, 


CB 








LEU 


238; 


N,CA,C, 


0, 


CB,CG, CD1,CD2 








LEU 


239: 


N,CA,C, 




CB, CG ,CD1,CD2 








SER 


241: 


,N,CA,C, 




CB, OG 








GLN 


"242: 


[N/CA/C, 




CByCG , CD , OE1 ,NE2 








LYS 


244: 


N,CA,C, 


o, 


CB,CG,CD,CE,NZ 








ASN 


246: 


N,CA, C, 


o ( 


CB,CG,0D1,ND2 








VAL 


247 


N,CA,C, 


o. 


CB f CGl,CG2 








GLN 


248 


N, CA,C, 


Oi 


CB r CG,CD,0El,NE2 








ILE 


249' 


N,CA,C, 


o, 


CB,CG1,CG2,CD1 








ILE 


253: 


N, CA,C, 


o ( 


CB,CG1,CG2,CD1 








ILE 


260: 


N,CA,C, 


o, 


CB,CG1,CG2,CD1 








SER 


261: 


N,CA,C, 


o, 


CB,0G 








THR 


263: 


N,CA,C ( 


o, 


CB,0G1,CG2 








ASN 


266: 


N,CA,C, 


o, 


CB,CG,0D1,ND2 








PHE 


267: 


N,CA,C, 


o, 


CB,CG,CD1,CD2,CE1 


,CE2, 


CZ 




LYS 


268: 


N,CA,C, 


0, 


CB,CG,CD,CE,NZ 
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PD4 9 8 FINALMODEL : HIS 
PD4 9 8 FINALMODEL : GLY 
PD4 9 8 FINALMODEL : THR 
PD498 FINALMODEL : HI S 
5 PD4 9 8 FINALMODEL : VAL 

PD4 9 8 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : GLY 
PD4 9 8 FINALMODEL : THR 
PD4 9 8 FINALMODEL : VAL 

1 0 PD4 9 8 FINALMODEL : LEU 

PD4 9 8 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : VAL 
PD4 9 8 FINALMODEL : ARG 
CG,CD,NE,CZ,NH1 

15 PD4 9 8 FINALMODEL : VAL 

PD4 9 8 FINALMODEL : LEU 
FD4 9 8 FINALMODEL rASP 
PD4 9 8 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : ASN 

2 0 PD4 9 8 FINALMODEL : GLY 

PD4 9 8 FINALMODEL: SER 
PD4 9 8 FINALMODEL : SER 
PD4 9 8 FINALMODEL: ILE 
CG1,CG2,CD1 

2 5 PD4 9 8 FINALMODEL : GLY 

PD4 9 8 FINALMODEL : ASN 
PD4 9 8 FINALMODEL : LEU 
PD4 9 8 FINALMODEL : SER 
PD4 9 8 FINALMODEL : LEU 

3 0 PD4 9 8 FINALMODEL : GLY 

PD4 9 8 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : GLY 

3 5 PD4 9 8 FINALMODEL : ASN 

PD4 9 8 FINALMODEL : VAL 
PD4 9 8 FINALMODEL : GLY 
PD4 9 8 FINALMODEL : ALA 
PD4 9 8 FINALMODEL : PHE 

40 CG , GDI , CD2 , CE1 , 

PD4 9 8 FINALMODEL: PRO 
PD4 9 8 FINALMODEL : GLY 
PD4 9 8 FINALMODEL: ILE 
001,062,001 

45 PD4 9 8 FINALMODEL: SER 

PD4 9 8 FINALMODEL : THR 
PD4 9 8 FINALMODEL : VAL 
PD4 9 8 FINALMODEL : PRO 
PD49 8 FINALMODEL : MET 

5 0 PD4 9 8 FINALMODEL : SER 

PD49 8 FINALMODEL : GLY 
PD4 9 8 FINALMODEL : THR 
PD4 9 8 FINALMODEL : SER 
PD 4 9 8 FINALMODEL : MET 

55 PD4 9 8 FINALMODEL: ALA 

PD4 9 8 FINALMODEL: SER 
PD49 8 FINALMODEL : PRO 



72:N, 
73:N, 
74:N, 
75:N, 
76:N f 
77:N, 
78:N, 
79:N, 
80:N, 
100:N 
101 :N 
102:N 
103:N 
,NH2 
104 :N 
105:N 
106 sN 
107:N 
108:N 
109:N 
110:N 
115:N 
116:N 



OA , C , O , CB , CG , ND1 , CD2 , CE1 , NE2 
CA,C,0 

CA,C, 0,CB f 0G1,CG2 
CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
CA,C,0,CB,CG1,CG2 
CA, C,0, CB 
CA C O 

CA^C0,CB,0G1,CG2 
CA,C,0,CB,CG1,CG2 
CA , C, O , CB , CG , CD 1 , CD2 
CA , C, O , CB 
CA,C,0,CB,CG1,CG2 
CA,C,0,CB, 



119 
132 
133 
134 
135 
136 
160 
161 
162 
163 
164 
182 
183 
184 
194 
CE2 , 
206 
207 
210 



N 
N 
N 
N 
N 
N 
N 
N 
N 
N 
N 
N 
N 
N 
N 

CZ 
:N 
:N 
:N 



212:N 
213:N 
214:N 
215:N 
222:N 
223:N 
224:N 
225:N 
226:N 
227:N 
228:N 
229:N 
230:N 



CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 

CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 

CA 
CA 
CA 

CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 



CB,CG1,CG2 
CB,CG,CD1,CD2 
GByGG , GDI , 0D2 
CB 

CB,CG,0D1,ND2 

CB,0G 
CB,0G 
CB, 

CB,CG,0D1,ND2 
CB,CG,CD1,CD2 
CB,0G 

CB,CG,CD1,CD2 

CB 
CB 
CB 

CB,CG,0D1,ND2 
CB,CG1,CG2 

CB 
CB, 



CD,C,0,CB,CG 
C,0 



C,0,CB, 



,CB, 
-CB, 
,CB, 
C,0, 
* CB 



C, 
C,_ 
C,0 
CD, I 
C,0 
C,0, 

c,o 
c,o, 
c,o, 
c,o, 
c,o, 
c,o,, 

CD,C, 



CB, 

CB, 
CB, 
CB, 
CB 
CB, 



0G 

0G1,CG2 

CG1,CG2 

CB,CG 

CG,SD,CE 

0G 

0G1,CG2 
OG 

CG,SD,CE 
OG 

CB,CG 
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PD4 9 8 FINALMODEL : HIS 231:N,CA,C,0,CB, 
CG,ND1,CD2,CE1,NE2 
Subset RESTx: 
restxmole. list 
5 Subset RESTX: 

NEWMODEL: 233-234 
restxatom. list 
Subset RESTX: 

NEWMODEL : ALA 233 :N, CA, C, 0, CB 
10 NEWMODEL : GLY 234:N,CA,C,0 

Example 4 

Suitable substitutions in the Arthr omvces ramosus peroxidase 
for addition of carboxvlic acid attachment groups (-COOH) 
15 Suitable locations for addition of carboxylic attachment 
groups (Aspartatic acids and Glutamic acids) in a non- 
hydrolytic enzyme, Arthromyces ramosus peroxidase were found as 
follows . 

The 3D structure of this oxido-reductase is available in the 
20 Brookhaven Databank as larp.pdb. This A / ramosus peroxidase 
contains 344 amino acid residues. The first eight residues are 
not visible in the X-ray structure: QGPGGGGG , and N143 is 
glycosylated. 

The procedure described in Example 1 was followed. 
25 The amino acid sequence of Arthromyces ramosus Peroxidase 

(E.C.I, 11. 1.7) is shown in SEQ ID NO 4. 

The commands performed in Insight (BIOSYM) are shown in the 

command files makeDEzone.bcl and makeDEzone2 .bcl below. The C- 

terminal residue is P344, the ACTSITE is defined as the heme 
30 group and the two histidines coordinating it (H56 & H184) . 

Conservative substitutions: 

makeDE z one . be 1 

Delete Subset * 

Color Molecule Atoms * Specified Specification 255,0,255 
35 Zone Subset ASP :asp:od* Static monomer /residue 10 Color_Subset 
255,255,0 

Zone Subset GLU :glu:oe* Static monomer /residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline C-terminal residue number according to the 
40 protein 

Zone Subset CTERM : 344:0 Static monomer /residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline ACTSITE residues acc rding to the protein 
Zone Subset ACTSITE :HEM,56,184 Static monomer/residue 8 
45 Color_Subset 255,255,0 

Combine Subset ALLZONE Union ASP GLU 
Combine Subset ALLZONE Union ALLZONE CTERM 



i 
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Combine Subset ALLZONE Union ALLZONE ACT SITE 
#NOTE: editnextline object name according to the protein 
Combine Subset REST Difference ARP ALLZONE 
List Subset REST Atom Output File restatom. list 
5 List Subset REST monomer /residue Output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
List Subset ACTSITE Atom Output File actsiteatom. list 
List Subset ACTSITE monomer/residue Output_File 
actsitemole . list 
10 # 

Zone Subset REST5A REST Static Monomer /Residue 5 -Color_Subset 
Combine Subset SUB5A Difference REST5A ACTSITE 
Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
15 List Subset SUB5B Atom Output File subSbatom . list 

List Su bset SUB5B monomer/ residue Output_File subSbmole. list 
#Now identify sites for asn->asp & gln->~giu substitutions and 

^continue with makezone2 .bcl. 
20 #Use grep command to identify asn/gln in restatom. list — 
#sub5batom. list & accsiteatom. list 

Comments : 

The subset REST contains Gln70, and SUB5B contains Gln34, 
25 Asnl28, Asn303 all of which are solvent exposed. The 

substitutions Q34E or Q34D, Q70E or Q70D, N128D or N128E and 
N303D or N303E are identified in A. ramosus peroxidase as sites 
for mutagenesis. Residues are substituted below and further 
analysis done: 

30 

Non-conservative substitutions: 
makeDEzone2 .bcl 

#sourcefile makezone2 .bcl Claus von der Osten 961128 
# 

35 #having scanned lists (grep gln/asn command) and identified 
sites for . . . 

#asn->asp & gln->glu substitutions 

#NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace ARP newmodel 
40 Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On ARP 

#NOTE: editnextlines with asn->asp & gln->glu positions 
Replace Residue newmodel: 34 glu L 
45 Replace Residue newmodel: 70 glu L 
Replace Residue newmodel: 128 asp L 
Replace Residue newmodel: 303 asp L 
# 

#Now repeat analysis done prior to asn->asp & gln->glu, ... 
50 #now including introduced asp & glu 

Color Molecule Atoms newmodel Specified Specification 255,0,255 
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Zone Subset ASPx newmodel: asp :od* Static monomer /residue 10 
Color_Subset 255, 255 ,0 

Zone Subset GLUx newmodel:glu:oe* Static monomer/residue 10 
Color_Subset 255,255,0 
5 #N0TE: editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERMx newmodel : 344 :0 Static monomer/ residue 10 
Color_Subset 255,255,0 

#N0TE : editnextline ACTSITEx residues according to the protein 
10 Zone Subset ACTSITEx newmodel : HEM f 56, 184 Static monomer /residue 

8 Color_Subset 255,255,0 

Combine Subset ALLZONEx Union ASPx GLUx 

Combine Subset ALLZONEx Union ALLZONEx CTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
15 Combine Subset RESTx Difference newmodel ALLZONEx 

List Subset RESTx Atom Output File restxatom. list 

List Subset RESTx monomer / residue "Output~Fire"~restxmole . list 

# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
20 List Subset ACTSITEx Atom Output File act sitexatom. list 
List Subset ACTSITEx monomer/resTdue Output_File 
actsitexmole . list 
# 

#read restxatom. list or restxmole. list to identify sites for 
25 (not_gluasp) ->gluasp - - * 
#subst. if needed 

Comments : 

The subset RESTx contains only four residues: S9, S334, G335 

30 and P336, all of which are >5% solvent exposed. The mutations 
S9D, S9E, S334D, S334E, G335D, G335E, P336D and P336E are 
proposed in A. ramosus peroxidase. Acidic residues within the 
subset ACTSITE are: E44, D57, D77, E87, E176, D179, E190, D202, 
D209, D246 and the N-tenninal carboxylic acid on P344. Of these 

35 only E44, D77, E176, D179, E190, D209, D246 and the N-terminal 
carboxylic acid on P344 are solvent exposed. Suitable sites for 
mutations are E44Q, D77N, E176Q, D179N, E190Q, D209N and D246N. 
D246N and D246E are risky mutations due to D246's importance 
for binding of heme. 

40 The N-terminal 8 residues were not included in the 

calculations above, as they do not appear in the structure. 
None of these 8 residues, QGPGGGG , contain carboxylic groups. 
The following variants are proposed as possible mutations to 
enable attachment to this region: Q1E, Q1D, G2E, G2D, P3E, P3D, 

45 G4E, G4D, G5E, G5D , G6E, G6D, G7E, G7D, G8E, G8D. 
Relevant data for Example 4: 
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Solvent accessibility data for A* ramosus peroxidase (Note: 
as the first eight residues are missing in the X-ray structure, 
the residue numbers printed in the accessibility list below are 
8 lower than those used elsewhere for residue numbering. 



5 


# ARP 


Thu Jan 30 15:39:05 MET 1997 




# residue 


area 




SER 1 


143.698257 




VAL 2 


54.879990 




THR 3 


86.932701 


10 


CYS 4 


8.303715 




PRO 5 


126.854782 




GLY 6 


53.771488 




GLY 7 


48.137802 




GLN_8 


62.288475 


15 


SER 9 


79.932549 




THR 10 


16.299215 




SER 11 


81.928642 




ASN 12 


51.432678 




SER 13 


81.993019 


20 


GLN 14 


92.344009 




CYS 15 


0.000000 




CYS 16 


32.317432 




VAL 17 


54.067810 




TRP 18 


6.451035 


25 


PHE 19 


25.852070 




ASP 20 


79.033997 




VAL 21 


0.268693 




LEU 22 


22.032858 




ASP 23 


90.111404 


30 


ASP 24 


43.993240 




LEU 25 


1.074774 




GLN 26 


25.589321 




THR 27 


82.698059 




ASN 28 


96.600883 


35 


PHE 29 


32 .375275 




TYR 30 


5.898365 




GLN 31 


103.380585 




GLY 32 


40.042034 




SER 33 


46.789322 


40 


LYS 34 


87.161873 




CYS 35 


12.827215 




GLU 36 


51.582657 




SER 37 


16.378180 




PRO 38 


33.560043 


45 


VAL 39 


6.448641 




ARG 40 


7.068311 




LYS 41 


15.291286 




ILE 42 


1.612160 




LEU 43 


1.880854 


50 


ARG 44 


16.906845 




ILE 4 5 


0.000000 




VAL 46 


2.312647 




PHE 47 


2.955627 




HIS 48 


20.392527 


55 


ASP 49 


4.238116 
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ALA 


50 


0.510757 




ILE" 


"51 


1.576962 




gly" 


"52 


2.858601 




phe] 


"53 


48.633503 


5 


ser" 


"54 


8.973248 




pro" 


"55 


58.822315 




ALA" 


"56 


59.782852 




LEU 57 


46.483955 




THR 58 


86.744827 


10 


ALA 59 


89.515816 




ALA 60 


81.163239 




GLY 61 


70.119019 




GLN 62 


112.635498 




PHE 


63 


93.522354 


15 


GLY" 


"64 


2.742587 




gly" 


"65 


13.379636 




GLY 


66 


22 .722847 




gly" 


"67 


0.000000 




ALA" 


"68 


0.268693 


20 


asp" 


"69 


12.074840 




gly" 


"70 


0.700486 




ser" 


"71 


0.000000 




ile" 


"72 


0.000000 




ile" 


"73 


0.000000 


25 


ALA" 


"74 


17.304443 




HIS~ 


"75 


41.071186 




ser" 


"76 


20.000793 




asn" 


"77 


120.855316 




ile" 


"78 


66.574982 


30 


GLU" 


"79 


2.334954 




LEU" 


"80 


41.329689 




ALA" 


"81 


77.370575 




PHE" 


"82 


38.758774 




pro" 


83 


131.946289 


35 


ALA" 


"84 


34.893864 




asn" 


"85 


5.457000 




gly" 


86 


43.364151 




gly" 


"87 


51.561348 




leu" 


88 


0.242063 


40 


thr" 


89 


73.343575 




asp" 


90 


130.139389 




tor" 


91 


17.863211 




ile" 


"92 


0.268693 




GLU 93 


92.210396 


45 


ALA 94 


35.445068 




LEU 


95 


1.343467 




arg" 


96 


31.175611 




ALA" 


97 


44.650192 




VAL" 


98 


17.698566 


50 


gly" 


"99 


1.471369 




ile" 


"100 


62.441463 




asn" 


*101 


107.139748 




his" 


102 


46.952496 




gly" 


"103 


46.559296 


55 


val" 


104 


11.342628 




ser" 


105 


15.225677 




phe" 


106 


6.422011 
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GLY 


107 


3.426864 




ASP 


108 


10.740790 




LEU 109 


0.268693 




ILE 110 


1.880854 


5 


GLN 


111 


31.867456 




PHE 112 


0.000000 




ALA 113 


0.000000 




THR 


114 


3.656114 




ALA" 


115 


8.299393 


10 


val" 


"116 


0.268693 




gly" 


"117 


0.268693 




MET 


118 


3.761708 




SER 


119 


14.536770 




ASN 


120 


25.928799 


15 


CYS" 


121 


0. 537387 




PRO 


122 


29.798336 




GLY 


-1-2-3- 


33 .080013 




SER" 


124 


17.115562 




pro" 


'125 


36.908714 


20 


arg" 


"126 


108.274727 




LEU 


"127 


21.238588 




GLU" 


"128 


53 .742313 




PHE" 


'129 


3.761708 




LEU" 


"130 


12.928699 


25 


thr" 


"131 


10.414591 




GLY 


"132 


47.266495 




arg" 


*133 


12.247048 




ser" 


"134 


63.047237 




asn" 


"135 


31.403708 


30 


ser" 


"136 


97.999619 




ser" 


"137 


28.505201 




gln" 


"138 


102.845520 




PRO 139 


49.691917 




SER 140 


9.423104 


35 


PRO 


141 


25.724171 




PRO" 


"142 


80.706665 




SER" 


"143 


105.318176 




leu" 


"144 


20.154398 




ile" 


"145 


41.288322 


40 


pro' 


"146 


10.462679 




gly" 


"147 


19.803421 




pro" 


"148 


18.130360 




gly" 


"149 


47.391853 




asn" 


"150 


60.248917 


45 


THR 151 


87.887985 




VAL 152 


13.870322 




THR 


153 


74.664734 




ALA" 


'154 


45.251106 




ILE" 


'155 


2.686934 


50 


leu" 


"156 


28.720940 




asp" 


"157 


110.081253 




arg" 


"158 


31.228874 




met" 


"159 


1.612160 




gly" 


"160 


38.223858 


55 


asp" 


"161 


46.293152 




ALA" 


"162 


9.877204 




GLY' 


"163 


34.267326 
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PHE 164 


11.057570 




SER 165 


51.158882 




PRO 


166 


62.767738 




asp" 


"167 


75.164917 


5 


GLU" 


"168 


43.334976 




VAL" 


"169 


6.365355 




val" 


"170 


2.955627 




asp" 


"171 


7.004863 




leu" 


"172 


1.880854 


10 


leu" 


"173 


3.197691 




ALA" 


"174 


0.000000 




ALA" 


"175 


1.074774 




his" 


"176 


0.502189 




ser" 


"177 


0.806080 


15 


leu" 


"178 


3.197691 




ALA" 


"179 


3.337480 




SER" 


'180 


0.466991 




GLN* 


"181 


2.122917 




GLU* 


"182 


40.996552 


20 


GLY" 


"183 


62.098671 




LEU" 


"184 


23.954853 




asn" 


"185 


15.918136 




ser" 


"186 


95.185318 




ALA" 


"187 


59.075272 


25 


ile" 


"188 


27.675419 




PHE 189 


102.799423 




ARG 190 


55.265549 




SER 191 


6.986028 




PRO 192 


2.686934 


30 


LEU 193 


12.321225 




ASP 


194 


2.127163 




SER" 


"195 


33.556419 




thr" 


"196 


33.049286 




pro" 


"197 


20.874798 


35 


gln" 


"198 


65.729698 




VAL 199 


31.705818 




PHE 200 


4.753195 




ASP 


201 


13.744506 




thr" 


'202 


1.612160 


40 


gln" 


"203 


16.081930 




phe" 


"204 


2.581340 




tyr" 


"205 


1.880854 




ile" 


"206 


9.356181 




GLU" 


"207 


0.735684 


45 


THR~ 


"208 


10.685907 




LEU" 


"209 


9.672962 




LEU" 


"210 


2.955627 




LYS" 


"211 


77.176834 




GLY" 


"212 


40.968609 


50 


thr" 


'213 


78.718216 




thr" 


"214 


21.738384 




GLN 215 


77.622299 




PRO 216 


25.441587 




GLY 


217 


8.320850 


55 


pro" 


"218 


96.972305 




SER" 


"219 


64.627823 




LEU" 


"220 


85.732414 
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CT.V 


221 

£t X 




PHF~ 

Mr flft 


*222 
C* £t £t 




ALA™ 


223 




GTXf 


224 


5 


GLU" 


225 




"LFtf 


226 




SER~ 


227 




pro" 


228 




phf~ 


~229 


X \J 


PPO~ 


~2 30 

*, J w 




\J±J X 


2 71 

4* J X 






2 7 2 
& J z 




phf" 

IT XI £t 


~2**7 

A J J 






2** A 


X J 


mft" 

1*1 Xj X 


~2TR 

£. ^ *J 




aph" 


~27fi 




QFP~ 


Z J / 




A^p" 
nor 


2 7R 
zoo 




AT.A~ 




20 


T.FTl" 


*2 AO 




T.Ft)" 


*2 4 1 
Z ** X 




ALA" 


"242 




arg' 


"243 




asp" 


"244 


25 


ser" 


"245 




arg" 


"246 




thr" 


"247 




ALA" 


"248 




CYS" 


"249 


30 


arg" 


"250 




TRP" 


"251 




GLN_ 


252 




ser" 


"253 




met" 


"254 


35 


thr" 


"255 




SER 256 




SER~2 57 




ASN 


258 




GLU" 


"259 


40 


VAL" 


"260 




met" 


"261 




gly" 


"262 




gln" 


"263 




arg" 


"264 


45 


tyr" 


"265 




arg" 


"266 




ALA" 


"267 




ALA" 


"268 




met" 


"269 


50 


ALA" 


"270 




LYS" 


"271 




MET" 


272 




ser" 


"273 




val" 


"274 


55 


leu" 


'275 




gly" 


"276 




phe" 


"277 



27.361111 

134.620178 

3.873014 

12.141763 

65.129868 

76.105843 

0.268693 

7.017754 

0.000000 

47.827423 

23.790522 

6.643466 

6.713862 

18.012030 

4.598188 

91.415581 

1.982125 

6.246871 

12.897283 

76.820526 

3.224321 

1.400.973 

77.207176 

36.207306 

104.023796 

121.852341 

2.955627 

4.810700 

47.331306 

62.062778 

2.418241 

5.554953 

38.284832 

1.124224 

0.000000 

53.758987 

37.276134 

44.381340 

149.565140 

57.500389 

2.679314 

10.175152 

107.458916 

36.402130 

0.233495 

91.179619 

53.708500 

6.504294 

17.122011 

22.455158 

73.386177 

3.959508 

15.043281 

23.887930 

17.196379 

44.362202 

68.062485 
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ASP_278 
ARG_279 
ASN_280 
ALA_281 
5 LEU_282 
THR_283 
ASP_284 
CYS_285 
SER286 

10 ASP_287 
VAL_288 
ILE_289 
PRO_290 
SER_291 

15 ALA_292 
VAL_293 
SER_294 
ASN_295 
ASN_296 

20 ALA_297 
ALA_298 
PRO_299 
VAL_300 
ILE_3 01 

25 PRO_302 
GLY_303 
GLY_304 
LEUJ305 
THR_306 

3 0 VAL_307 
ASPJ308 
ASP_309 
ILE_310 
GLUJ311 

35 VALJ512 
SERJJ13 
CYSJ314 
PRO_315 
SER_316 

40 GLU_317 
PRO_318 
PHEJ319 
PRO_320 
GLU_321 

45 ILEJ322 
ALA_323 
THR_324 
ALA__325 
SER_326 

50 GLYJ327 
PRO_328 
LEU_329 
PROJ330 
SER_331 

55 LEU_332 
ALA_333 
PRO 334 



94.902039 

113.549011 

134.886017 

72.340973 

26.692348 

27.696728 

72.214157 

0.000000 

28.209335 

64.560753 

7.040061 

8.665112 

48.682365 

86.141670 

29.031240 

84.432014 

85 . 944153 

49.017288 

133.459198 

57.283794 

65.233749 

24.751518 

45.409184 

8.060802 

14.742939 

16.589832 

34.238071 

24.719791 

49.356300 

71.491821 

130.906174 

31.733070 

19.581894 

81.414574 

94.769890 

39.688896 

9.998511 

120.328018 

95.364319 

65.560959 

100.254364 

46.284115 

31.328060 

177.602249 

33.449741 

46.892982 

79.976471 

36.423820 

124.467422 

28.219524 

107.553696 

86.789825 

34.287163 

75.764053 

32.840569 

61.516434 

82.389992 
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ALA_335 6.246871 
PRO_336 56,750813 
HEMJJ37 60,435017 
CA_338 2.078997 
5 CA_339 0.000000 
NAG_340 141.534668 
NAG_341 186.311371 
Subset REST: 

restmole. list 
10 Subset REST: 

ARP: 9 ,69-70, 125, 127 ,133 ,299-3 01, 334-336 
restatom. list 
Subset REST: 

ARP: SER 9:N,CA,C,O r CB,OG 
15 ARP: GLY 69:N,CA,C,0 

ARP: GLN 70 :N, CA, C, O, CB, CG, CD, OE1, NE2 
ARP: GLY 125:N,CA,C,0 
ARP : SER 127:N,CA,C,0,CB,OG 
ARP:PRO 13 3 : N , CA , CD , C , O , CB , CG 
20 ARP: SER 299 :N, CA, C, O, CB,OG 

ARP: ALA 300:N f CA,C,O,CB 
ARP: VAL 301 :N, CA, C,0, CB, CGI , CG2 
ARP: SER 334:N,CA,C,0,CB,OG 
ARP: GLY 335:N,CA,C,0 
25 ARP:PR0 336:N,CA,CD,C,0,CB,CG 

Subset SUB5B: 

sub5bmole . list 
Subset SUB5B: 

ARP: 10-11, 34, 38, 65-68,71-72,120-121, 123-124, 
30 128-132,134,270,274, 

ARP: 297-298, 302-303, 311-312,332-333, 337-338 





sub5batoni 


.list 




Subset SUB5B: 




ARP 


.VAL 


10:N,< 


35 


ARP 


: THR 


11:N,< 




ARP 


;GLN 


34:N,< 




ARP 


:TYR 


38:N, 




ARP 


:LEU 


65:N, 




ARP 


. THR 


66:N, 


40 


ARP 


.ALA 


67:N, 




ARP 


: ALA 


68:N,< 




ARP 


;PHE 


71:N,< 




ARP, 


:GLY 


72:N,< 




ARP 


:PHE 


120:N 


45 


ARP 


: ALA 


121:N 




ARP 


: ALA 


123:N 




ARP, 


:VAL 


124:N 




ARP 


;ASN 


128:N 




ARP 


•CYS 


129:N 


50 


ARP 


PRO 


130:N 




ARP 


:GLY 


131:N 




ARP 


.SER 


132:N 




ARP ' 


•ARG 


134:N 




ARP 


.GLY 


270:N 


55 


ARP 


ARG 


274:N 




ARP 


ILE 


297:N 




ARP 


.PRO 


298:N 
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ARPrSER 302 :N, CA, 0,0, CB,OG 
ARP: ASN 303 :N, CA, C,0, CB, CG,0D1,ND2 
ARP : GLY 311:N,0A,C,O 
ARPiGLY 312:N,CA,C,0 
5 ARP:THR 332 :N, CA, C,0, CB,OGl , CG2 

ARP: ALA 333 :N,CA,C,0,CB 
ARP: LEU 337:N,CA,C,0,CB,CG,CD1,CD2 
ARP:PRO 3 3 8 : N , OA , CD ,0,0, OB , CG 
Subset ACTSITE: 
10 actsitemole. list 
Subset ACTSITE: 

ARP: 44-61, 75-77 , 79-80 , 87-88 , 90-96, 

99 , 118 , 122 , 126 , 135 , 148-149 , 152-158 , 
ARP: 163-164 , 167 , 176-194 , 197-205 , 207-209 ,211- 
15 213,216,230-231,241, 

ARP: 243-246, 249, 259, 273, 277, 280, 343-347H 
actsiteatom. list 
Subset ACTSITE: 



20 



25 



30 



35 



40 



45 



50 



55 



ARP: 


GLU 


44: 


N , OA ,0,0 


,CB,CG,CD,0E1,0E2 






ARP: 


SER 


45: 


N, OA, 0,0 


,CB,OG 






ARP: 


PRO 


46: 


N,CA,CD,< 


:,o,cb,cg 






ARP: 


;VAL 


47: 


N,CA,C,0 


,CB,CG1,CG2 






ARP: 


: ARG 


48: 


N , OA ,0,0 


fCB^G^DjNE^Z^Hl 


,NH2 




ARP: 


:LYS 


49: 


N,CA,C,0 


,CB,CG,CD,CE,NZ 






ARP: 


ILE 


50: 


N, OA, 0,0 


r CB,CGl,CG2,CDl 






ARP; 


LEU 


51: 


N, OA, 0,0 


,CB,CG,CD1,CD2 






ARP: 


ARG 


52: 


N, OA, 0,0 


,CB,CG,CD,NE,CZ,NH1 


,NH2 




ARP: 


ILE 


53: 


N, OA, 0,0 


,CB,CG1,CG2,CD1 






ARP: 


:VAL 


54: 


N, OA, 0,0 


,CB,CG1,CG2 






ARP: 


PHE 


55: 


>N, OA, 0,0 


, CB , CG , CD1 , CD2 , CE1 , 


CE2 , 


CZ 


ARP; 


;HIS 


56: 


;N,CA,C,0 


,CB,CG,ND1,CD2,CE1, 


NE2 




ARP: 


:ASP 


57: 


,N,CA,C,0 


,CB,CG,0D1,0D2 






ARP: 


: ALA 


58: 


N,CA,C,0 


,CB 






ARP: 


:ILE 


59' 


,N,CA,C,0 


,CB,CG1,CG2,CD1 






ARP: 


:GLY 


60' 


>N,CA,C,0 








ARP; 


:PHE 


61 


:N,CA,C,0 


,CB,CG,CD1,CD2,CE1, 


CE2, 


CZ 


ARP: 


: GLY 


75, 


N,CA,C,0 








ARP: 


: ALA 


76 


;N,CA,C,0 


,CB 






ARP: 


:ASP 


77 


:N,0A,C,O 


,CB,CG,0D1,0D2 






ARP 


:SER 


79 


:N,CA,C,0 


,CB,OG 






ARP 


:ILE 


80 


:N,CA,C,0 


,CB,CG1,CG2,CD1 






ARP 


:GLU 


87 


:N,CA,C,0 


,CB,CG,CD,0E1,0E2 






ARP 


:LEU 


88 


:N,CA,C,0 


, CB , CG , CD1 , CD2 






ARP 


:PHE 


90 


:N,CA,C,0 


, CB , CG , CD1 , CD 2 , CE1 , CE2 , 


CZ 


ARP 


:PRO 


91 


:N,CA,CD, 


C , 0 , CB , CG 






ARP 


: ALA 


92 


:N,CA,C,0 


,CB 






ARP 


:ASN 


93 


:N,CA,C,0 


, CB , CG , 0D1 , ND2 






ARP 


:GLY 


94 


:N,CA,C,0 








ARP 


: GLY 


95 


:N,CA,C,0 








ARP 


:LEU 


96 


:N,CA,C,0 


,CB,CG,CD1,CD2 






ARP 


:THR 


99 


:N,0A,C,O 


,CB,0G1,CG2 






ARP 


:ILE 


118:N,CA,C,< 


0,CB,CG1,CG2,CD1 






ARP 


:THR 


122:N,CA,C, 


D,CB,0G1,CG2 






ARP 


:MET 


126:N,CA,C, 


0,CB,CG,SD,CE 






ARP 


:LEU 


135:N,CA,C, 


0,CB,CG,CD1,CD2 






ARP 


:SER 


148:N,CA,C, 


0,CB,OG 






ARP 


:PRO 


149:N,CA,CD 


,C,0,CB,CG 
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ARP: 


LEU 


152: 


N, 




ARP: 


ILE 


153: 


N f 




ARP: 


PRO 


154: 


N, 




ARP: 


GLY 


155: 


N, 


5 


ARP: 


PRO 


156: 


N, 




ARP: 


GLY 


157: 


N, 




ARP: 


ASN 


158: 


N, 




ARP: 


ILE 


163: 


N, 




ARP: 


LEU 


164: 


N, 


10 


ARP: 


MET 


167: 


N, 




ARP: 


GLU 


176: 


N, 




ARP: 


VAL 


177: 


N, 




ARP: 


VAL 


178: 


N, 




ARP: 


ASP 


179: 


N, 


15 


ARP: 


LEU 


180: 


N, 




ARP: 


LEU 


181: 


N, 




-ARP; 


ALA 182 : 


iiy 




ARP: 


ALA 


183: 


N, 




ARP: 


HIS 


184: 


N, 


20 


ARP: 


SER 


185: 


N, 




ARP: 


LEU 


186: 


N, 




ARP: 


ALA 


187: 


N, 




ARP: 


SER 


188: 


N, 




ARP: 


GLN 


189: 


N, 


25 


ARP: 


GLU 


190: 


N, 




ARP: 


GLY 


191: 


N, 




ARP: 


LEU 


192: 


N, 




ARP: 


.ASN 


193: 


N, 




ARP: 


:SER 


194: 


N, 


30 


ARP: 


:PHE 


197: 


*N, 




ARP: 


: ARG 


198: 


,N, 




ARP: 


:SER 


199: 


>N, 




ARP 


:PRO 


200: 


:N, 




ARP 


:LEU 


201< 


:N, 


35 


ARP 


:ASP 


202 


:N, 




ARP 


:SER 


203 


:N, 




ARP 


:THR 


204 


:N, 




ARP 


:PRO 


205 


IN, 




ARP 


:VAL 


207 


:N, 


40 


ARP 


:PHE 


208 


:N # 




ARP 


:ASP 


209 


:N, 




ARP 


: GLN 


211 


:N, 




ARP 


:PHE 


212 


:N, 




ARP 


:TYR 


213 


:N, 


45 


ARP 


: THR 


216 


:N, 




ARP 


:PHE 


230 


:N ( 




ARP 


: ALA 


231 


:N, 




ARP 


:PHE 


241 


:N, 




ARP 


:MET 


243 


:N 


50 


ARP 


: ARG 


244 


:N 




ARP 


:SER 


245 


:N 




ARP 


:ASP 


246 


:N 




ARP 


:LEU 


249 


:N 




ARP 


:TRP 


259 


:N 


55 






CD2, 




ARP: TYR 


273 


:N 




ARP: MET 


277 


:N 



[,CA,C,0,CB,CG,CD1,CD2 
r,CA,C,0,CB,CGl,CG2,CDl 
r,CA,CD,C,0,OB,CG 

r, ca, c,o 

f , C A , CD , C , O , CB , CG 

r,CA,c,o 

r,CA,C,0,CB,CG,ODl,ND2 
f,CA,C,0,CB,CGl,CG2,CDl 
f,CA, 0,0,08,00,001,002 
r,CA,C,0,CB,CG,SD,CE 
f,CA,C,0,CB,CG,CD,OEl,OE2 
f, CA, 0,0,0b, CGI, CG2 
f,CA,C,0,CB,CGl,CG2 
f,CA,C,0,CB,CG,0Dl,0D2 
f,CA,C,0,CB,CG,CDl,CD2 
r,CA,C,0,CB,CG,CDl,CD2 

r T eA,e,0rGB 

r,CA,C,0,CB 

r,CA,C,0,CB,CG,NDl / CD2,CEl,NE2 
l,CA,C,0,CB,OG 
f,CA,C,0,CB,CG,CDl,CD2 
l,CA,C,0,CB 
f,CA,C,0,CB,OG 
f,CA,C,0,CB,CG,CD,0El,NE2 
[,CA,C,0,CB,CG,CD,OE1 / OE2 
f,CA,C,0 

f,CA,C,0,CB,CG,CDl,CD2 
f,CA,C,0,CB,CG,0Dl,ND2 
r,CA,C,0,CB,OG 

r,CA,C,0,CB,CG,CDl,CD2,CEl,CE2,CZ 
r,CA,C,0,CB,CG / CD,NE,CZ,NHl,NH2 
r,CA,C,0,CB,OG 
f,CA,CD,C,O,0B,CG 
r,CA,C,0,CB,CG,CDl,CD2 
f,CA,C,0,CB,CG,0Dl,0D2 
f,CA,C,0,CB,OG 
r,CA,C,0,CB,0Gl,CG2 
r,CA,CD,C,0,CB,CG 
f,CA,C,0,CB,CGl,CG2 
[,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
f,CA,C,0,CB,CG,0Dl,0D2 
[,CA,C,0,CB,CG,CD,0E1,NE2 
r,CA,C,0,CB,CG,CDl,CD2,CEl,CE2,CZ 
r,CA,C,0,CB / CG,CDl,CD2 / CEl,CE2,CZ,OH 
f,CA,C,0,CB,0Gl,CG2 
f , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 
r,CA,C,0,CB 

f,CA,C,0,CB,CG,CDl,CD2,CEl,CE2 # CZ 
f,CA,C,0,CB,CG,SD,CE 
f,CA,C,0,CB,CG,CD,NE,CZ,NHl,NH2 
l,CA,C,0,CB,OG 
f,CA,C,0,CB,CG,0Dl,0D2 
f,CA,C,0,CB,CG,CDl,CD2 
f,CA,C,0,CB,CG,CDl, 
1 , NE1 , CE2 , CE3 , CZ2 , CZ3 , CH2 
I,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
f,CA,C,0,CB,CG,SD,CE 
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ARP:MET 280:N,CA,C,O,CB,CG, SD,CE 
ARP: ALA 343 :N,CA, 0,0,08 
ARP: PRO 344:N,CA,CD,C,0,OXT,CB,CG 
ARP: HEM 3 4 5H : FE , NA , NB , NC , ND , CHA , CHB , 
5 CHC, CHD, CIA, C2A , C3A , C4A, CMA, CAA, CBA, CGA 

ARP: HEM 345H:01A, 02A, C1B, C2B, C3B, C4B, CMB, 

CAB,CBB,C1C,C2C,C3C,C4C,CMC,CAC,CBC 
ARP: HEM 345H: C1D, C2D, C3D, C4D, CMD, CAD, CBD, CGD, OlD, 02D 
ARP:CA 346H:CA 
10 ARP:CA 347H:CA 

Subset RESTx: 

restxmole . list 
Subset RESTX 

NEWMODEL: 9, 334-336 
15 restxatom. list 
Subset RESTX: 

NEWMODEL : SER 9 : N , GA , C , O , CB , GG 
NEWMODEL: SER 334 :N, CA, C,0, CB,0G 
NEWMODEL : GLY 335:N,CA,C,0 
20 NEWMODEL : PRO 336 :N, CA, CD, C, O, CB,CG 



Example 5 

Activation of mPEG 15.000 with N-succinimidvl carbonate 

25 mPEG 15,000 was suspended in toluene (4 ml/g of ioPEG) 20% was 

distilled off at normal pressure to dry the reactants 
azeotropically. Dichloromethane (dry 1 ml/g mPEG) was added when 
the solution was cooled to 30°C and phosgene in toluene (1.93 M 5 
mole/mole mPEG) was added and mixture stirred at room temperature 

30 over night. The mixture was evaporated to dryness and the desired 
product was obtained as waxy lumps. 

After evaporation dichloromethane and toluene (1:2, dry 3 
ml/g mPEG) was added to re-dissolve the white solid. N-Hydroxy 
succinimide (2 mole/mole mPEG.) was added as a solid and then 

35 triethylamine (1.1 mole/mole mPEG) . The mixture was stirred for 3 
hours . initially unclear , then clear and ending with a small 
precipitate. The mixture was evaporated to dryness and 
recrystallised from ethyl acetate (10 ml) with warm filtration to 
remove salts and insoluble traces. The blank liquid was left for 

40 slow cooling at ambient temperature for 16 hours and then in the 
refrigerator over night. The white precipitate was filtered and 
washed with a little cold ethyl acetate and dried to yield 98 % 
(w/w) . NMR Indicating 80 - 90% activation and 5 o/oo (w/w) 
HNEt 3 Cl. X H-NMR for mPEG 15,000 (CDCI3) d 1.42 t (1= 4.8 CH 3 i 

45 HNEt 3 Cl), 2.84 s (1= 3.7 succinimide), 3.10 dq (1= 3.4 CH 2 i 
HNEt 3 Cl), 3.38 s (1= 2.7 CH 3 i OMe) , 3.40* dd (I = 4.5 0/00, 13 C 
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satellite), 3.64 bs (I « 1364 main peak), 3.89* dd (I = 4.8 o/oo , 
13 C satellite), 4.47 dd (I = 1.8, CH 2 in PEG). No change was seen 
after storage in a desiccator at 22 °C for 4 months. 

5 Example 6 

Activation of mPEG 5.000 with N-succinimidvl carbonate 

Activation of mPEG 5,000 with N-succinimidyl carbonate was 
performed as described in Example 5. 

10 EXAMPLE 7 

Construction and expression of PD498 variants : 

PD498 site-directed variants were constructed using the "maxi- 

oligonucleotide-PCR" method described by Sarkar et al. f (1990): 

BioTechniques 8: 404-407. 
15 The template plasmid was shuttle vector pPD498 or an analogue 

of this containing a variant of the PD4 9 8 protease gene. 

The following PD498 variants were constructed, expressed and 

purified. 

A: R28K 
20 B: R62K 

C: R169K 

D: R28K + R62K 

E: R28K + R169K 

F: R62K + R169K 
25 G: R28K+R69K+R169K 

Construction of variants 

For introduction of the R28K substitution a synthetic 

oligonucleotide having the sequence: GGG ATG TAA CCA AGG GAA GCA 
30 GCA CTC AAA CG (SEQ ID NO. 7) was used. 

A PGR fragment of 769 bp was ligated into the pPD498 plasmid 

prepared by Bst E II and Bgl II digestion. Positive variants were 

recognized by Styl digestion and verified by DNA sequencing of the 

total 769 bp insert. 
35 For introduction of the R62K substitution a synthetic 

oligonucleotide having the sequence: 

CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) was used. 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 
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prepared by Bst E II and Bgl II digestion. Positive variants were 
recognized by Clal digestion and verified by DNA sequencing of the 
total 769 bp insert. 

For introduction of the R169K substitution a synthetic 
5 oligonucleotide having the sequence: 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO, 9) was used. 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 
prepared by Bst E II and Bgl II digestion. Positive variants were 
recognized by the absence of a Rsa I restriction site and verified 
10 by DNA. sequencing of the total 769 bp insert. 

For simultaneously introduction of the R28K and the R62K 
substitutions, synthetic oligonucleotides having the sequence: 
GGG ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID NO. 7) and the 
sequence: 

15 CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 
pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl and Clal digestion and verified 
by DNA sequencing of the total 769 bp insert. 

20 For simultaneously introduction of the R28K and the R169K 
substitutions, synthetic oligonucleotides having the sequence: GGG 
ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID NO. 8) and the 
sequence: 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 8) were used 
25 simultaneously. A PCR fragment of 769 bp was ligated into the 
pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl digestion and absence of a Rsa I 
site. The variant was verified by DNA sequencing of the total 769 
bp insert. 

3 0 For simultaneously introduction of the R62K and the R169K 
substitutions, synthetic oligonucleotides having the sequence: CGA 
CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) and the sequence: 
CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 

35 pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Clal digestion and absence of a Rsa I 
site. The variant was verified by DNA sequencing of the total 769 
bp insert 
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For simultaneously introduction of the R28K, the R62K and the 
R169K substitutions, synthetic oligonucleotides having the 
sequence : 

GGG ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID No, 7), the 
5 sequence : 

CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) and the 
sequence: 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 
10 pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl and Clal digestion and absence of 
a Rsa I site. The variant was verified by DNA sequencing of the 
total 769 bp insert. 

15 Fermentation, expression and purificatio n of PD498 variants 

Vectors hosting the above mentioned PD498 variants were 
purified from E. coli cultures and transformed into B. subtilis in 
which organism the variants were fermented, expressed and purified 
as described in the "Materials and Methods" section above. 

20 

Example 7 

Conjugation of triple substitited PD498 variant with activated 
ynPEG 5 t 000 

200 mg of triple substituted PD498 variant (i.e. the 
25 R28K+R62K+R169K substituted variant) was incubated in 50 mm 
NaBorate, pH 10, with 1.8 g of activated mPEG 5,000 with N- 
succinimidyl carbonate (prepared according to Example 2), in a 
final volume of 20 ml. The reaction was carried out at ambient 
temperature using magnetic stirring. Reaction time was 1 hour. The 
30 reaction was stopped by adding DMG buffer to a final concentration 
of 5 mM dimethyl glutarate, 1 mM CaCl 2 and 50 mM borate, pH 5.0. 

The molecule weight of the obtained derivative was approxi- 
mately 120 kDa, corresponding to about 16 moles of mPEG attached 
per mole enzyme. 

35 Compared to the parent enzyme, residual activity was close to 

100% towards peptide substrate (succinyl-Ala-Ala-Pro-Phe-p- 
Nitroanilide) . 
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Example 8 

Alleraenicitv trails of PD498 variant-SPEG5 , 000 in guinea pigs 

Dunkin Hartley guinea pigs are stimulated with 1.0 ug PD498- 
SPEG 5,000 and 1.0 ug modified variant PD498-SPEG 5,000 by 
5 intratracheal installation. 

Sera from immunized Dunkin Hartley guinea pigs are tested 
during the trail period in a specific IgG x ELISA (described above) 
to elucidate whether the molecules could activate the immune 
response system giving rise to a specific IgGi response indicating 
10 an allergenic response. 

The IgGi levels of Dunkin Hartley guinea pigs during the trail 
period of 10 weeks are observed. 

Example 9 

15 Suitable substitutions in Humxcola lanuginosa lipase for 

addition of amino attachment groups (-NHo) 

The 3D structure of Humicola lanuginosa lipase (SEQ ID NO 6) 

is available in Brookhaven Databank as ltib.pdb. The lipase 

consists of 269 amino acids. 
20 The procedure described in Example 1 was followed. The 

sequence of H. lanuginosa lipase is shown below in the table 

listing solvent accessibility data for H. lanuginosa lipase. 

H . lanuginosa residue numbering is used (1-269) , and the active 

site residues (functional site) are S146, S201 and H258. The 
25 synonym TIB is used for tf. lanuginosa lipase. 

The commands performed in Insight (BIOSYM) are shown in the 

command files makeKzone. bcl and makeKzone2 . bcl below: 



Conservative substitutions: 

30 makeKzone. bcl 

1 Delete Subset * 

2 Color Molecule Atoms * Specified Specification 255,0,255 

3 Zone Subset LYS :lys:NZ Static monomer/ residue 10 
Color_Subset 255,255,0 

35 4 Zone Subset NTERM :1:N Static monomer /residue 10 
Color_Subset 255,255,0 

5 #N0TE: editnextline ACTSITE residues according to the 
protein 

6 Zone Subset ACTSITE : 146,201,258 Static monomer /residue 8 
40 ColorjSubset 255,255,0 

7 Combine Subset ALLZONE Union LYS NTERM 

8 Combine Subset ALLZONE Union ALLZONE ACTSITE 

9 #N0TE: editnextline object name according to the protein 
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10 Combine Subset REST Difference TIB ALL ZONE 

11 List Subset REST Atom Output File restatom. list 

12 List Subset REST monomer /residue Output_File restmole. list 

13 Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
5 14 List Subset ACTSITE Atom Output^File actsiteatom. list 

15 List Subset ACTSITE monomer /residue Output_File 
actsitemole. list 

16 # 

17 Zone Subset REST5A REST Static Monomer /Residue 5 - 
10 Col or_Subs e t 

18 Combine Subset SUB5A Difference REST5A ACTSITE 

19 Combine Subset SUB5B Difference SUB 5 A REST 

20 Color Molecule Atoms SUB5B Specified Specification 
255,255,255 

15 21 List Subset SUB5B Atom Output File sub5batom. list 

22 List Subset_SUB5B monomer / residue Output File subSbmole. list 

23 #Now identify sites for lys->arg substitutions and continue 
with makezone2.bcl 

24 #Use grep command to identify ARG in restatom. list , 
20 subSbatom. list & accsiteatom. list 

Comments : 

In this case of H . lanuginosa (=TIB) , REST contains the 
Arginines Argl3 3, Argl39, Argl60, Argl79 and Arg 209, and SUB5B 
25 contains Argll8 and R125. 

These residues are all solvent exposed. The substitutions 
R133K, R139K, R160K, R179K, R209K, R118K and R125K are 
identified in TIB as sites for mutagenesis within the scope of 
this invention. The residues are substituted below in section 
30 2, and further analysis done. The subset ACTSITE contains no 
lysines. 



Non-conservative substitutions: 
makeKzone2 .bcl 

35 1 #sourcefile makezone2 .bcl Claus von der Osten 961128 

2 * 

3 #having scanned lists (grep arg command) and identified 
sites for lys->arg substitutions 

4 #N0TE: editnextline object name according to protein 
40 5 Copy Object -To_Clipboard -Displace TIB newroodel 

6 Biopolymer 

7 #N0TE: editnextline object name according to protein 

8 Blank Object On TIB 

9 #NOTE: editnextlines with lys->arg positions 
45 10 Replace Residue newmodel : 118 lys L 

11 Replace Residue newmodel: 125 lys L 

12 Replace Residue newmodel: 133 lys L 

13 Replace Residue newmodel: 139 lys L 

14 Replace Residue newmodel: 160 lys L 
50 15 Replace Residue newmodel: 179 lys L 

16 Replace Residue newmodel: 209 lys L 
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17 # 

18 #Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

19 Color Molecule Atoms newmodel Specified Specification 
5 255,0,255 

20 Zone Subset LYSx newmodel: lys:NZ Static monomer/residue 10 
Color_Subset 255,255,0 

21 Zone Subset NTERMx newmodel: 1:N Static monomer/residue 10 
Color_Subset 255,255,0 

10 22 #N0TE: editnextline ACTSITEx residues according to the 
protein 

23 Zone Subset ACTSITEx newmodel: 146, 201, 258 Static 
monomer/residue 8 Color_Subset 255,255,0 

24 Combine Subset ALLZONEx Union LYSx NTERMx 

15 25 Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 

26 Combine Subset RESTx Difference newmodel ALLZONEx 

27 List Subset RESTx Atom output File rest xatoui.l is t 

28 List Subset RESTx monomer /residue Output_File 
restxmole . list 

20 29 # 

30 Color Molecule Atoms ACTSITEx Specified Specification 
255,0,0 

31 List Subset ACTSITEx Atom Output^File actsitexatom. list 

32 List Subset ACTSITEx monomer /residue Output_File 
25 actsitexmole. list 

33 # 

34 #read res txatom. list or restxmole. list to identify sites 
for (not_arg) ->lys subst. if needed 

3 0 Comments : 

Of the residues in RESTx, the following are >5% exposed (see 
lists below): 18,31-33,36,38,40,48,50,56-62,64,78,88,91-93,104- 
106,120,136,225,227-229,250,262,268. Of these three are 
Cysteines involved in disulfide bridge formation, and 
35 consequently for structural reasons excluded from the residues 
to be mutated. The following mutations are proposed in H. 
lanuginosa lipase (TIB) : 

A18K,G31K,T32K / N33K,G38K,A40K,D48K,T50K,E56K,D57K,S58K,G59K, 
V60K,G61K,D62K,T64K,L78K,N88K,G91K,N92K,L93K,S105K,G106K, 
40 V120K,P136K,G225K,L227K,V228K,P229K,P250K,F262K. 
Relevant data for Example 2: 

# TIBNOH20 

# residue area 
GLU_1 110.792610 

45 VAL_2 18.002457 

SER_3 53.019516 

GLN_4 85.770164 

ASP 5 107.565826 

LEU~6 33.022659 

50 PHEJ7 34.392754 

ASN 8 84.855331 
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92 





GLN 9 




PHE 10 




ASN 11 




LEU 12 


5 


PHE 13 




ALA 14 




GLN 15 




TYR 16 




SER 17 


10 


ALA 18 




ALA 19 




ALA 20 




TYR 21 




CYS 22 


15 


GLY 23 




LYS 24 




ASN25 




ASN 26 




ASP 27 


20 


ALA 28 




PRO 29 




ALA 30 




GLY 31 




THR 32 


25 


ASN 33 




ILE 34 




THR 35 




CYS_36 




THR 37 


30 


GLY 38 




ASN~39 




ALA 40 




CYS 41 




PRO 42 


35 


GLU 43 




VAL~44 




GLU 45 




LYS 46 




ALA 47 


40 


ASP 48 




ALA 49 




THR 50 




PHE 51 




LEU 52 


45 


TYR 53 




SER 54 




PHE 55 




GLU 56 




ASP 57 


50 


SER 58 




GLY 59 




VAL 60 




GLY 61 




ASP 62 


55 


VAL 63 




THR 64 




GLY 65 



39.175591 

2.149547 

40.544380 

27.648788 

2.418241 

4.625293 

28.202387 

0.969180 

0.000000 

7.008336 

0.000000 

0.000000 

6.947358 

8.060802 

32.147034 

168.890747 

8 . 014721 

II. 815564 
92.263428 
18.206699 
83.188431 
69.428421 
50.693439 
52.171135 

III. 230743 
2.801945 
82.130569 
17.269245 
96.731941 
77.870995 
123.051003 
27.985256 
0.752820 
46.258949 
69.773987 
0.735684 
77.169510 
141.213562 
10.249716 
109.913902 
2.602721 
32.012184 
8.255627 
60.093613 
77.877937 
26.980494 
10.747735 
112.689758 
92.064278 
32.990780 
53.371807 
83.563644 
69.625633 
75.520988 
4.030401 
8.652839 
0.000000 
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PHE 


66 


U . 




LEU 


"67 


11 Q O 7 C 1 A 




ALA" 


'68 


n RT71Q7 
U« jJ / Jo / 




LEU" 


'69 


JU«£4JO/U 


c 
D 


asp" 


"70 


u • uuuuuu 




asn" 


"71 


OA i m fiA A 
o4 • XUXU44 




thr~ 


"72 


QQ ^71 1 Ofi 




asn" 


"73 


/U . /4i4UX 




LYS" 


"74 


aa 1 1 0. 1 £ Q 


10 


LEU" 


"75 


o • J2949;> 




ILE" 


"76 


5 • 19 /o /o 




VAL 77 


U . SUbUoU 




LEU 78 


O . 29 /O 




SER 


79 


U • UUUUUU 


15 


PHE" 


"80 


2 • 079151 




arg" 


"81 


41 • 085312 




gly" 


"82 


1 jITI ICO 

1 . 4713o9 




ser" 


"83 


4 J • / 940X4 




arg" 


"84 


XUU . 2olo2 / 


20 


ser" 


"85 


/U . OU /D02 




ile" 


"86 


CO C O £ O C C 




glu" 


"87 






asn" 


"88 


lift T7IIT70. 

XX9 . j/oj/j 




TRP 


"89 




ZD 


ILE" 


"90 


7ft nCDt\DQ 
/ O • UDODOO 




gly" 


'91 


OU • / OJ OU / 




asn" 


'92 






leu" 


'93 


XJ4 • O J O J 




asn" 


"94 




3 n 


PHE" 


"95 






asp" 


"96 


7Q A45Q50 




leu" 


"97 


75 7R1 572 




LYS" 


"98 


QQ OAOOfil 
O O • OHU&Q J 




GLU" 


"99 


iio 377OQ0 


35 


ILE 100 


7 • X J *J *J ' *J 




ASN 101 


63 444527 




ASP 


102 


88 652847 




ILE" 


"103 


33 470661 




CYS" 


"104 


11 553816 


/n 


SER 105 


go 461174 




GLY 106 


40 . 325161 




CYS 


107 


4 433561 




arg" 


"108 


Q7 450104 




gly" 


'109 


1,343467 


45 


his" 


"no 


4.652464 




asp" 


"ill 


37.023655 




gly" 


"112 


29.930408 




PHE 


'113 


14.976435 




THR" 


"114 


10.430954 


50 


ser" 


"115 


40.606895 




ser" 


"116 


13.462922 




TRP 117 


10.747735 




ARG 118 


114.364281 




SER 


119 


46.880249 


55 


VAL" 


"120 


13.434669 




ALA" 


"121 


18.258261 




ASP" 


"122 


110.753098 
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THR_123 
LEU_124 
ARG_125 
GLN_126 
5 LYS_127 
VAL_128 
GLU_129 
ASP_130 
ALA_131 

10 VAL_132 
ARG_133 
GLU_134 
HIS_135 
PRO_X36 

15 ASP_137 
TYR_138 
ARG_JL39 
VAL_140 
VAL_141 

20 PHE_142 
THR_143 
GLY_144 
HIS_145 
SER_146 

25 LEU_147 
GLY_148 
GLY_149 
ALAJ.50 
LEU_151 

30 ALA_152 
THR_153 
VALJL54 
ALA_155 
GLY_156 

35 ALA_157 
ASP_158 
LEU_159 
ARG_160 
GLY_161 

40 ASN_162 
GLY_163 
TYR_164 
ASP_165 
ILE_166 

45 ASP_167 
VAL_168 
PHE_169 
SER_170 
TYR_171 

50 GLY_JL72 
ALA_173 
PRO_174 
ARG_175 
VALJL76 

55 GLY_177 
ASN_178 
ARG 179 



69.641922 

17-090784 

73.929977 

101.320190 

84.450241 

6.448641 

47.700993 

75.529091 

11.340775 

27.896025 

153.136490 

132.140594 

54.553406 

97.386963 

22.653191 

35.392658 

74.321243 

10.173222 

0.233495 

3.224321 

0.000000 

0.000000 

4.514527 

15.749787 

40.709171 

0.000000 

0.000000 

0.537387 

22.838938 

0.268693 

18.078798 

7.254722 

0.000000 

0.000000 

15.140230 

41.645477 

6.144750 

41.939716 

68.978180 

68.243805 

79.181274 

36.190247 

103.068283 

0.000000 

24.326443 

4.299094 

0.466991 

3.339332 

0.000000 

0.000000 

12.674671 

13.117888 

10.004488 

21.422220 

2.680759 

21.018063 

110.282166 
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ALA 


180 


33.210381 




phe" 


"181 


4.567788 




ALA 


"182 


3.897251 




GLU 


"183 


76.354004 


5 


phe" 


"184 


71.225983 




leu" 


"185 


24.985012 




thr" 


"186 


47.023815 




VAL 187 


98.244606 




GLN 188 


54.152954 


10 


THR 


189 


88.660645 




gly" 


"190 


24.792120 




GLY 191 


10.726818 




THR 192 


45.458744 




LEU 


193 


16.633211 


15 


tyr" 


"194 


34.829491 




arg" 


"195 


29.030851 




ile" 


-196 


1.973557 




thr" 


"197 


3.493014 




his] 


"198 


1.532270 


20 


thr" 


"199 


34.785877 




asn" 


"200 


39.789238 




ASP 201 


0.000000 




ILE 202 


31.168434 




VAL 203 


29.521076 


25 


PRO 


204 


3.515322 




ARG 205 


44.882454 




LEU 206 


51.051746 




PRO 


207 


12.575329 




PRO" 


"208 


43.259636 


30 


arg" 


"209 


113.700233 




GLU" 


"210 


154.628540 




phe" 


"211 


112.505188 




gly" 


'212 


30.084938 




tyr] 


"213 


3.268936 


35 


ser" 


"214 


12.471436 




his" 


"215 


23.354481 




ser" 


216 


16.406200 




SER 


"217 


14.665598 




PRO" 


"218 


17.240993 


40 


GLU" 


219 


13.145291 




TYR" 


220 


18.718306 




TRP" 


221 


39.229233 




ILE" 


222 


5.105175 




LYS 223 


120.739983 


45 


SER 224 


15.407301 




GLY 225 


29.306646 




THR 226 


66.806862 




LEU 


227 


122.682808 




VAL" 


228 


60.923004 


50 


PRO" 


"229 


104.620377 




VAL" 


230 


23.398251 




THR" 


231 


63.372971 




ARG" 


232 


80.357857 




ASN" 


233 


89.255066 


55 


ASP" 


234 


43.011250 




ILE 


235 


2.114349 




VAL 


236 


45.140491 
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LYS 


237 


105. 651306 




ile" 


"238 


24, 671705 




GLU" 


"239 


116,891907 




gly" 


"240 


31.965794 


5 


ile" 


"241 


46. 278099 




asp" 


"242 


28.963699 




ala" 


"243 


25.158146 




THR 244 


98.351440 




GLY 245 


43.842186 


10 


GLY 


246 


0.700486 




ASN~ 


'247 


3.926274 




asn" 


"248 


51. 047890 




gln" 


"249 


66. 699188 




pro" 


"250 


132 . 414047 


15 


asn" 


'251 


70.213730 




ILE 252 


141.498062 




PRO 253 


59 . 089233 




ASP 254 


59. 010895 




ILE 


255 


63 . 298943 


20 


pro" 


"256 


78.608688 




ALA" 


"257 


0.806080 




his" 


"258 


3.761708 




leu" 


"259 


50.747856 




TRP 260 


35.229710 


25 


TYR 261 


5.440791 




PHE 


262 


36.457939 




GLY" 


"263 


22.071375 




LEU 264 


109.148178 




ILE 265 


2.418241 


30 


GLY_ 


266 


17.730062 




thr" 


"267 


68.217873 




CYS" 


"268 


15.418195 




leu" 


"269 


165.990997 



Subset REST: 
35 restmole. list 
Subset REST: 

TIB: 5 ,8-9, 13-14,16, 18-20,31-34, 36,38, 40 ,48-50, 56- 
66,68,76-79,88,91-93, 

TIB: 100-107 , 116-117 , 119-121 , 132-134 , 136 , 139-142 , 154- 
40 169,177-185, 

TIB : 187, 189-191, 207-212, 214-216 ,225, 227-229, 241- 

244,250,262,268 
restatom. list 
Subset REST: 
45 TIB:ASP 5 :N, CA, C,0, CB,CG,ODl, 0D2 

TIB: ASN 8:N, CA,C,0, CB,CG,0D1,ND2 

TIB : GLN 9 : N , CA , C , O , CB , CG , CD , OE1 , NE2 

TIB: PHE 13:N,CA,C,0,CB,CG,CD1,CD2 / CE1,CE2, CZ 

TIB: ALA 14 :N, CA, C, O, CB 
50 TIB: TYR 16 :N, CA, C,0, CB, CG, CD1 , CD2 , CE1, CE2 , CZ , OH 

TIB: ALA 18 :N, CA, C, O, CB 

TIB: ALA 19 :N, CA, C, O, CB 

TIB: ALA 20 :N, CA, C, O, CB 

TIB : GLY 31:N,CA,C,0 
55 TIB: THR 32 :N,CA, C,0,CB,0G1,CG2 

TIB: ASN 33 :N,CA, C,0,CB,CG,0D1,ND2 

TIB: ILE 34:N,CA,C,0,CB,CG1,CG2,CD1 
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97 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 
TIB 



CYS 
GLY 
ALA 
ASP 
ALA 
THR 
GLU 
ASP 
SER 
GLY 
VAL 
GLY 
ASP 
VAL 
THR 
GLY 
PHE 
ALA 
ILE 
VAL 
LEU 
SER 
ASN 
GLY 
ASN 
LEU 
ILE 
ASN 
ASP 
ILE 
CYS 
SER 
GLY 
CYS 
SER 
TRP 



36 


: N, CA, 


r C j 


. 0 


. CB . SG 


38 


:N, CA, 


P C, 


fO 




40 


:N,CA ( 


r c J 


rO 


,CB 


48 


:N,CA, 


c, 


f o 


,CB,CG,0D1,0D2 


49 


:N,CA, 






f CB 


50 


:N, CA, 




rO 


,CB,0G1, CG2 


56 


:N, CA, 


r c, 


o 


, CB , CG , CD , OE1 , OE2 


57 


:N,CA, 


Cj 


o 


f CB , CG , OD1 , OD2 


58 


:N,CA, 




o 


, CB , OG 


59 


: N , C A , 


c, 


o 




60 


:N,CA, 


c< 


o 


. CB . CGI . CG2 


61 


:N, CA ( 


c ( 


0 




62 


:N,CA, 


c t 


0 


,CB,CG,ODl,OD2 


63 


:N,CA, 




o 


,CB,CG1,CG2 


64 


, N , C A , 


c, 


0 


,CB,OGl,CG2 


65 


N,CA, 


c, 


0 




66 « 


>N,CA, 


c, 


0 


, CB , CG , CDl , CD2 , CE1 , CE2 , CZ 


68 


N,CA, 


c, 


0 


r CB 


76 


*N, CA, 


c, 


o 


,CB,CG1,CG2,CD1 


77 


N,CA, 


c, 


o 


,CB,CG1,CG2 


78. 


N,CA, 


c, 


o 


,CB,CG,CD1,CD2 


79* 


N,CA, 


c, 


0 


r CB,OG 


88 


N,CA, 


c, 


0 


r CB, CG f ODl,ND2 


91; 


N,CA, 


c, 


0 




92: 


N, CA, 


c, 


0 


r CB,CG,ODl,ND2 


93: 


N, CA, 


c, 


o, 


CB, CG, CDl, CD2 



CE3 , CZ2 
TIB : SER 
TIB: VAL 
TIB: ALA 
TIB: VAL 
TIB: ARG 
TIB: GLU 
TIB: PRO 
TIB: ARG 
TIB: VAL 
TIB: VAL 
TIB: PHE 
TIB: VAL 
TIB: ALA 
TIB: GLY 
TIB: ALA 
TIB: ASP 
TIB: LEU 
TIB: ARG 
TIB: GLY 
TIB: ASN 



100 : N , CA , C , 0 , CB , CGI , CG2 , CDl 

101:N,CA,C,O,CB,CG,ODl,ND2 

102 :N f CA, C,0, 06,00,001,002 

103:N,CA,C,0,CB,CG1,CG2,CD1 

104:N,CA,C,O,CB,SG 

105:N,CA,C,O,CB,OG 

106:N,CA,C,O 

107:N,CA,C,O,CB / SG 

116:N,CA,C,0,CB,OG 

117:N,CA,C,0,CB,CG,CD1,CD2,NE1,CE2, 
,CZ3,CH2 



119 
120 
121 
132 
133 
134 
136 
139 
140 
141 
142 
154 
155 
156 
157 
158 
159 
160 
161 
162 



N,CA,C,0,CB,OG 

N,CA,C,0,CB,CG1,CG2 

N , CA , C , O , CB 

N,CA,C,0,CB,CG1,CG2 

N,CA,C,0,CB,CG,CD,NE,CZ,NH1 / NH2 

N,CA,C,0,CB,CG,CD,OEl,OE2 

N,CA,CD,C,0,CB,CG 

N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 

N,CA,C,0,CB,CG1,CG2 

N,CA,C,0,CB,CG1,CG2 

N,CA,C,0,CB,CG,CD1,CD2 / CE1,CE2,CZ 

N,CA,C,0,CB,CG1,CG2 

N,CA,C,0,CB 

N,CA,C,0 

N,CA,C,0,CB 

N,CA,C,0,CB,CG,ODl,OD2 

N,CA,C,0,CB,CG,CD1,CD2 

N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 

N,CA,C,0 

N,CA,C,0,CB,CG,0D1,ND2 
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TIB 


:GLY 


163 


:N 


,CA,C,0 






TIB 


:TYR 


164 


:N 


,CA,C,0,CB,CG,CD1,CD2,CE1, 


CE2,CZ,OH 




TIB 


:ASP 


165 


:N 


r OA , C , 0 , CB , CG , 0D1 , OD 2 






TIB 


:ILE 


166 


:N 


, OA, 0 , 0 , CB , CGI , CG2 , GDI 




5 


TIB 


:ASP 


167 


:N 


, OA, C, 0 , CB , CG , 0D1 , OD2 






TIB 


:VAL 


168 


:N 


,CA,C,0,CB,CG1, CG2 






TIB 


:PHE 


169 


:N 


,CA,C,0,CB,CG,CD1,CD2,CE1, 


CE2,CZ 




TIB 


: GLY 


177 


:N 


,CA,C,0 






TIB 


:ASN 


178 


:N 


,CA,C,0,CB,CG ,0D1,ND2 




10 


TIB 


:ARG 


179 


:N 


, CA , C , 0 , CB , CG , CD , NE , CZ , NH 1 


,NH2 




TIB 


: ALA 


180 


:N 


, CA , C , 0 , CB 






TIB 


PHE 


181 


:N 


, CA , C , 0 , CB , CG , CD1 , CD2 , CE1 , 


CE2 , CZ 




TIB 


ALA 


182 


:N 


r CA, C, 0 , CB 






TIB 


GLU 


183 


:N 


r CA , C , 0 , CB , CG , CD , 0E1 , 0E2 




15 


TIB 


PHE 


184 


:N 


CA,C,0,CB,CG,CD1,CD2,CE1, 


CE2 , CZ 




TIB 


LEU 


185, 


:N 


CA . C . 0 . CB , CG . CD 1 . CD2 






TIB" 


rvAL 


187 , 


:N 


CA . C , 0 . 'CB'. CGI . CG2 






TIB 


;THR 


189, 


:N, 


r CA , C , 0 , CB , OG 1 , CG2 






TIB 


► GLY 


190< 


:N 


CA,C.O 




20 


TIB 


; GLY 


191 


:N 


CA f C r O 






TIB 


:PRO 


207, 


:N 


CA , CD , C , O , CB , CG 






TIB 


:PRO 


208: 


:N 


r CA , CD , C , 0 , CB , CG 






TIB 


ARG 


209' 


:N 


CA , C , O , CB , CG , CD , NE , C Z , NH1 


,NH2 




TIB 


GLU 


210, 


:N, 


CA f C,0,CB,CG,CD,OEl,OE2 




25 


TIB 


PHE 


211: 


:N, 


CA , C , 0 , CB , CG , CD1 , CD 2 , CE1 , 


CE2 , CZ 




TIB: 


GLY 


212: 


:N, 


CA,C,0 






TIB: 


SER 


214: 


:N, 


CA ,0,0, CB , OG 






TIB: 


HIS 


215: 


:N, 


CA,C,0,CB,CG,ND1,CD2,CE1, 


NE2 




TIB: 


SER 


216: 


:N ( 


CA,C,0,CB,OG 




30 


TIB: 


GLY 


225: 


:N, 


CA,C,0 






TIB: 


LEU 


227: 


:N, 


CA , C , 0 , CB , CG , CD 1 , CD 2 






TIB: 


VAL 


228: 


:N, 


CA,C,0,CB,CG1,CG2 






TIB: 


PRO 


229: 


:N, 


CA , CD , C , 0 , CB , CG 






TIB: 


ILE 


241: 


N, 


CA,C,0,CB,CG1,CG2,CD1 




35 


TIB: 


ASP 


242: 


N, 


0A,C,O,CB,CG,0Dl,OD2 






TIB: 


ALA 


243: 


N, 


CA,C,0,CB 






TIB: 


THR 


244: 


► N 


CA,C,O f CB,OGl,CG2 






TIB: 


PRO 


250, 


N 


CA , CD , C , O , CB , CG 






TIB: 


PHE 


262, 


>N 


CA,C,0,CB,CG,CD1,CD2,CE1, 


CE2,CZ 


40 


TIB: 


CYS 


268: 




CA , C , O , CB , SG 





Subset SUB5B: 
sub5mole. list 



Subset SUB5B: 

TIB: 3-4, 6-7, 10-12, 15, 22-23, 25-30, 35, 37, 39, 41-42, 44-47, 51- 
45 55,67,69-70, 

^8:72,74-75,94-99,108-112,114-115,118,122-126,128- 
131,135,137-138, 

TIB: 186,188, 192-195, 213, 217-219, 223-224, 230-23 1,2 34-2 3 5,238- 
240, 

50 TIB:245,269 

subSbatom. list 
Subset SUB5B : 

TIB: SER 3 : N, CA, C, O, CB,OG 
TIB: GLN 4 :N,CA, C,0, CB, CG,CD,0E1,NE2 
55 TIB: LEU 6 :N, CA, C, O, CB , CG, CD1 , CD2 

TIB : PHE 7 : N , CA, C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 
TIB: PHE 10:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
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TIB 


:ASN 


11 


:N,CA,C,0,CB,CG,0D1,ND2 








TIB 


:LEU 


12 


:N,CA,C,0,CB,CG,CD1,CD2 








TIB 


: GLN 


15 


:N,CA,C,0,CB,CG,CD,0E1,NE2 








TIB 


:CYS 


22 


:N,CA,C,0,CB,SG 






5 


TIB 


:GLY 


23 


:N,CA,C,0 








TIB 


:ASN 


25 


:N,CA,C,O f CB,CG,0D1,ND2 








TIB 


:ASN 


26 


:N,CA,C,0,CB,CG,0Dl f ND2 








TIB 


:ASP 


27 


:N,CA,C,0,CB,CG,0D1,0D2 








TIB 


: ALA 


28 


:N,CA,C,0,CB 






10 


TIB 


:PRO 


29 


:N,CA,CD,C,0,CB,OG 








TIB 


: ALA 


30 


:N,CA,C,0,CB 








TIB 


: THR 


35 


:N,CA,C,0,CB,0G1,CG2 








TIB 


: THR 


37 


:N,CA,C,0,CB,0G1,CG2 








TIB 


:ASN 


39 


:N,CA,C,O,CB,CG,0Dl,ND2 






15 


TIB 


:CYS 


41 


:N,CA,C,0,CB,SG 








TIB 


:PRO 


42 


;N,CA,CD,C,0,CB,CG 








TIB 


: VAL 


44 


:N,CA,C,07CB,CGr,CG2 








TIB 


:GLU 


45: 


N,CA,C,O,CB,0G,CD,0El,OE2 








TIB 


:LYS 


46 


N,CA,C,0,CB,CG,CD,OE,NZ 






20 


TIB 


: ALA 


47 


N,CA,C,0,CB 








TIB 


:PHE 


51 


N,CA,C,0,CB,CG,CD1,CD2,CE1, 


CE2, 


CZ 




TIB 


:LEU 


52, 


N,CA,C,0,CB,CG,CD1,CD2 








TIB 


: TYR 


53, 


N,CA,C,0,CB,CG,CD1,CD2,CE1, 


CE2, 


CZ,OH 




TIB. 


SER 


54: 


N,CA,C,O,0B,0G 






25 


TIB: 


PHE 


55: 


N,CA,C,0,CB,CG,CD1,CD2,CE1, 


CE2, 


CZ 




TIB: 


LEU 


67: 


N,CA,C,0,CB,CG,CD1,CD2 








TIB: 


LEU 


69: 


N,CA,C,0,CB,CG,CD1,CD2 








TIB: 


ASP 


70: 


N,CA, 0,0,06,06,001,002 








TIB: 


THR 


72: 


N,CA,C,0,CB,0G1,CG2 






30 


TIB: 


LYS 


74: 


N,CA,C,0,CB,CG,CD,CE,NZ 








TIB: 


LEU 


75: 


N,CA,C,0,CB,CG,CD1,CD2 








TIB: 


ASN 


94: 


N,CA, C,0,CB,CG,0D1,ND2 








TIB: 


PHE 


95: 


N,CA,C,O,CB,CG,C01,CD2,CEl, 


CE2, 


CZ 




TIB: 


ASP 


96: 


N,CA,C,0,CB,CG,0D1,0D2 






35 


TIB: 


LEU 


97: 


N,CA,C,0,CB,CG,CD1,CD2 








TIB: 


LYS 


98: 


N,CA,C,0,CB,CG,CD,CE,NZ 








TIB: 


GLU 


99: 


N,CA,C,0,CB,CG,CD,OEl,OE2 








TIB: 


ARG 


108:N,CA,C,O,CB,CG,CD,NE,CZ,NHl,NH2 




TIB: 


GLY 


109:N,CA,C,0 






40 


TIB: 


HIS 


110:N,CA,C,O,CB / CG,NDl / CD2,CEl 


, NE2 






TIB: 


ASP 


111:N,CA,C,0,CB,CG,0D1,0D2 








TIB: 


GLY 


112:N,CA,C,0 








TIB: 


THR 


114:N,CA,C,0,CB,0G1,CG2 








TIB: 


SER 


115:N,CA,C,0,CB,0G 






45 


TIB: 


ARG 


118:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 




TIB: 


ASP 


122 :N, OA, 0,0, CB, CG,0D1,0D2 








TIB: 


THR 


123:N,CA,C,0,CB,0G1,CG2 








TIB: 


LEU 


124 :N,CA, 0,0,08,00,001,002 








TIB: 


ARG 


125:N,CA,C,O,CB,CG,CD,NE,0Z,NHl,NH2 


50 


TIB: 


GLN 


12 6 : N , OA , C , 0 , CB , CG , CD , 0E1 , NE2 








TIB: 


VAL 


128:N,CA,C,0,CB,CG1,CG2 








TIB: 


GLU 


129:N,CA,C,0,CB,CG p CD,OEl,OE2 








TIB: 


ASP 


130:N,CA,C,0,CB,CG,OD1,OD2 








TIB: 


ALA 


131 :N, CA, C, 0, CB 






55 


TIB: 


HIS 


135:N,CA,C,0,CB,CG,ND1,CD2,CE1 


,NE2 






TIB: 


ASP 


137:N,CA,C,0,CB,CG,ODl,OD2 








TIB: 


TYR 


138:N,CA,C,0,CB,CG,CD1,CD2,CE1 


,CE2 


,CZ,0H 
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TIB: THR 186:N,CA,C,0,CB,0G1,CG2 

TIB: GLN 188:N,CA,C,0,CB,CG,CD / 0E1,NE2 

TIB : THR 192 :N,CA, C,0,CB,OGl,CG2 

TIB: LEU 193 :N, OA, C, O, CB, CG, CD1, CD2 
5 TIB : TYR 194:N,CA, C,0,CB,CG,CD1,CD2 , CE1, CE2 ,CZ,OH 

TIB: ARG 195:N,CA,C,0,CB,CG,CD,NE,C2,NH1,NH2 

TIB: TYR 213:N,CA,C,0 / CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

TIB : SER 217 :N,CA, C,0,CB,OG 

TIB:PRO 218:N,CA,CD,C,0,CB,CG 
10 TIB : GLU 219:N,CA,C,0,CB,CG,CD,OE1,OE2 

TIB:LYS 223:N,CA,C,0,CB,CG,CD,CE,NZ 

TIB: SER 224:N,CA,C,0,CB,OG 

TIB:VAL 230:N,CA,C,O,CB,CGl,CG2 

TIB: THR 231:N,CA,C,0,CB,0G1,CG2 
15 TIB:ASP 234 :N,CA,C,0,CB,CG,0D1,0D2 

TIB: ILE 235:N,0A,C,O, CB , CGI , CG2 , CD1 

TIB: ILK 238 :N,CA,C,0, CB , CGI , CG2 , CB1 

TIB : GLU 239:N,CA,C,0,CB,CG,CD,0E1,0E2 

TIB : GLY 240:N,CA,C,O 
20 TIB: GLY 245:N,0A,C,O 

TIB: LEU 2 6 9 : N , OA , C , O , CB , OXT , CG , CD1 , CD2 
Subset ACTSITE: 

actsitemole . list 
Subset ACTSITE: 

25 TIB: 17 ,21,80-87 , 89-90 , 113 , 143-153 , 170-176 , 196-206,221- 

222,226,246-249, 
TIB:251-261, 263-267 
actsiteatom. list 
Subset ACTSITE: 
30 TIB : SER 17:N,CA,C,0,CB,0G 

TIB : TYR 21 :N, OA, C,0, CB, CG, CD1,CD2 , CE1 , CE2 , CZ ,0H 

TIB : PHE 80:N / CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

TIB: ARG 81:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 

TIB: GLY 82:N,CA,C,0 
35 TIB : SER 83 :N,CA, C,0,CB,OG 

TIB: ARG 84:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 

TIB: SER 85:N,CA,C,0,CB,OG 

TIB: ILE 86:N,CA,C,0,CB,CG1,CG2,CD1 

TIB: GLU 87 :N,CA,C,0, CB,CG, CD,0E1,0E2 
40 TIB:TRP 89 :N, CA, C, O, CB, CG, CD1 , CD2 ,NE1 , CE2 , CE3 , CZ2 , CZ3 , CH2 

TIB: ILE 90:N,CA,C,0,CB,CG1,CG2,CD1 

TIB: PHE 113:N,CA,C,O,CB,CG,CDl,CD2,CEl,CE2,0Z 

TIB: THR 143 :N, CA, 0,0, CB, 0G1 , CG2 

TIB : GLY 144:N,CA,C,0 
45 TIB:HIS 145:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

TIB: SER 146 :N, CA, C, O, CB, OG 

TIB: LEU 147:N,CA,C,0,CB,CG,CD1,CD2 

TIB : GLY 148:N,CA,C,0 

TIB : GLY 149:N,CA,C,0 
50 TIB: ALA 150 :N, CA, C, O, CB 

TIB: LEU 151 : N, CA, C, O, CB, CG, CD1 , CD2 

TIB: ALA 152 :N, CA, C, O, CB 

TIB : THR 153 : N, CA, 0,0, CB, 0G1, CG2 

TIB: SER 170 :N, CA, 0,0, CB, OG 
55 TIB : TYR 171:N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

TIB: GLY 172:N,CA,C,0 

TIB: ALA 173 : N, CA, C, O, CB 
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TIB:PRO 174:N,CA,CD,C,0,CB,CG 

TIB: ARG 175 :N,CA,C,0,CB, CG,CD,NE, CZ,NH1,NH2 

TIB : VAL 176:N,CA,C,0,CB, CGI , CG2 

TIB: ILE 196:N,CA,C,0,CB,CG1,CG2,CD1 
5 TIB : THR 197 :N,CA,C,0 / CB, OG1 , CG2 

TIB: HIS 198:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

TIB : THR 199:N,CA,C,0,CB,0G1,CG2 

TIB: ASN 200^^,0,0,08,00,001^02 

TIB: ASP 201:N,CA,C,0,CB,CG,OD1,OD2 
10 TIB : ILE 202 :N, OA, 0,0, CB, CGI, CG2 , CD1 

TIB: VAL 203 :N,CA,C,0,CB,CG1, CG2 

TIB:PR0 204:N,CA,CD,C,0,CB,CG 

TIB: ARG 205 :N, CA,C,0, CB, CG,CD / NE,CZ,NH1,NH2 

TIB: LEU 206 :N, CA,C,0, CB, CG, CD1,CD2 
15 TIB : TRP 

221:J^,CA,C,0,CB,CG,CD1,CD2,NE1,CE2,CE3,CZ2,C23,CH2 

TIB : ILE 2 22 : N , OA , C , O , CB, CGI , CG2 , 001 

TIB : THR 226:N,CA,C,0,CB,0G1,CG2 

TIB : GLY 246:N,0A,C,O 
20 TIB: ASN 247:N,CA,C,O,CB,0G,ODl,ND2 

TIB: ASN 248:N,CA,C,0,CB,CG,0D1,ND2 

TIB: GLN 249:N,CA,C,0,CB / CG,CD,OEl,NE2 

TIB: ASN 2 51 :N, CA,C,0,CB,CG,0D1,ND2 

TIB : ILE 252:N,CA,C,0,CB,CG1, CG2 , CD1 
25 TIB:PR0 253 :N,CA,CD, C,0, CB,CG 

TIB: ASP 254:N,CA,C,0,CB,CG,ODl,OD2 

TIB: ILE 255:N,CA,C,0,CB,CG1,CG2,CD1 

TIB:PR0 256:N,CA,CD,C,0,CB,CG 

TIB: ALA 257 :N,CA,C,0,eB 
30 TIB:HIS 258 :N,CA,C,0,CB,CG,ND1,CD2 ,CE1,NE2 

TIB: LEU 259:N,CA,C,O,CB,CG,0Dl,CD2 

TIB: TRP 

260:N,CA,C,0,CB,CG,CD1,CD2,NE1,CE2,CE3,CZ2,CZ3,CH2 

TIB:TYR 261:N,CA,C,O,CB,0G,CDl,CD2 ,CE1,CE2 ,CZ,0H 
35 TIB: GLY 263:N,CA,C,0 

TIB: LEU 264 :N,CA, C,0,CB,CG,CD1,CD2 

TIB: ILE 2 65:N,CA,C,O,CB,CGl,CG2,0Dl 

TIB: GLY 266:N,CA,C,0 

TIB : THR 267 :N , CA, 0,0, CB, 0G1, CG2 
40 Subset RESTX: 

restxmole. list 
Subset RESTX: 

NEWMODEL: 14, 16, 18-20, 31-34, 36, 38, 40, 48-50, 56-66, 68,78- 
79,88,91-93, 

45 NEWMODEL : 104-106 , 120 , 136 , 225 , 227-229 , 250 , 262 , 268 
restxatom. list 
Subset RESTX: 

NEWMODEL : ALA 14 :N, CA,C,0, CB 

NEWMODEL : TYR 16 : N, CA, C,0, CB, CG, CD1, CD2 , CE1, CE2 , CZ ,0H 
50 NEWMODEL : ALA 18 : N, CA, C,0, CB 

NEWMODEL : ALA 19 :N, CA,C,0,CB 

NEWMODEL : ALA 20 :N, CA, 0,0, CB 

NEWMODEL : GLY 31:N,CA,C f O 

NEWMODEL : THR 32 :N, CA, C, 0,CB f 0G1,CG2 
55 NEWMODEL : ASN 33 :N, CA,C,0,CB,CG,0D1,ND2 

NEWMODEL: ILE 34 :N,0A,C,O,CB,CGl,CG2 ,CD1 

NEWMODEL: CYS 36 :N , CA, C, O, CB, SG 
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NEWMODEL 


: GLY 


38 


;N,CA,C,0 






NEWMODEL 


: ALA 


40: 


;N,CA,C,0,CB 






NEWMODEL 


;ASP 


48 


:N,CA,C,0,CB,CG,ODl,OD2 






NEWMODEL 


: ALA 


49: 


:N,CA,C,0,CB 




5 


NEWMODEL 


:THR 


50 


:N,CA,C,0,CB,0G1,CG2 






NEWMODEL 


:GLU 


56: 


:N,CA,C,0,CB,CG,CD, OEl,OE2 






NEWMODEL 


:ASP 


57: 


;N,CA,C,O,0B,CG,0Dl,0D2 






NEWMODEL 


: SER 


58: 


:N,CA,C,0,CB,OG 






NEWMODEL 


: GLY 


59. 


:N,CA,C,0 




10 


NEWMODEL 


:VAL 


60: 


: N , OA , C , O , CB , CG 1 , CG2 






NEWMODEL 


:GLY 


61: 


:N,CA,C,0 






NEWMODEL, 


:ASP 


62' 


:N,CA,C,0,CB,CG,ODl,OD2 






NEWMODEL 


:VAL 


63: 


:N,CA,C,0,CB,CG1,CG2 






NEWMODEL 


: THR 


64: 


:N, CA,C,0,CB,0G1,CG2 




15 


NEWMODEL , 


GLY 


65: 


:N,CA,C,0 






NEWMODEL 


PHE 


66: 


N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2, 


CZ 




NEWMODEL 


ALA 


68: 


N,CA,C,0/CB 






NEWMODEL 


► LEU 


78: 


>N,CA,C,0,CB,CG,CD1,CD2 






NEWMODEL 


:SER 


79: 


N,CA,C,0,CB,OG 




20 


NEWMODEL 


:ASN 


88: 


N,CA,C,0,CB,CG,0D1,ND2 






NEWMODEL 


GLY 


91: 


N,CA,C,0 






NEWMODEL 


ASN 


92: 


N,CA,C,0,CB,CG,0D1,ND2 






NEWMODEL. 


■ LEU 


93: 


N,CA,C,0,CB,CG,CD1,CD2 






NEWMODEL' 


CYS 


104:N,CA,C,O,CB,SG 




25 


NEWMODEL: 


SER 


105:N, CA,C, 0,CB,OG 






NEWMODEL: 


GLY 


106:N,CA,C,0 






NEWMODEL: 


VAL 


120 :N,CA, 0,0,06,061,062 






NEWMODEL: 


.PRO 


136:N,CA,CD,C,0,CB,CG 






NEWMODEL: 


:GLY 


225:N,CA,C,0 




30 


NEWMODEL* 


.LEU 


227:N,CA,C,0,CB,CG,CD1,CD2 






NEWMODEL. 


'VAL 


228:N,CA,C,0,CB,CG1,CG2 






NEWMODEL: 


:PRO 


229:N,CA,CD,C,0,CB,CG 






NEWMODEL: 


.PRO 


250:N,CA,CD,C,O,CB,CG 






NEWMODEL: 


PHE 


262:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2 


,CZ 


35 


NEWMODEL: 


.CYS 


268:N,CA,C,0,CB,SG 





Example 10 

Providing a lipase variant E87K+D254K 
The Humicola lanuginosa lipase variant E87K+D254K was 
40 constructed, expressed and purified as described in WO 
92/05249. 

Example 11 

Lipase-S-PEG 15.000 conjugate 
45 The lipase variant E87K+D2 54K-SPEG conjugate was prepared as 
described in Example 7, except that the enzyme is the Humicola 
lanuginosa lipase variant (E87K+D2 54K) described in Example 10 
and the polymer is mPEG15,000. 



50 Example 12 
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Immunoaenecitv assessed as laG^ of lipase variant (D87K +D254K) in 
Balb/C mice 

Balb/c mice were immunized by subcutanuous injection of: 

i) 50 nl 0.9% (wt/vol) NaCl solution (control group, 8 mice) 
5 (control)/ 

ii) 50nl 0.9% (wt/vol) NaCl solution containing 25 ^g of protein 
of a Humicola lanuginosa lipase variant (E87K+D254K) (group 1, 

8 mice) (unmodified lipase variant), 

iii) 50% 0.9% (wt/vol) NaCl solution containing a Humicola 

10 lanugoinosa lipase variant substituted in position D87K+D254K and 

coupled to a N-succinimidyl carbonate activated rnPEG .15,- .0.0.0 .(.group 

2, 8 mice) (lipase-SPEGIS, 000) . 

The amount of protein for each batch was measured by optical 

density measurements. Blood samples (200 were collected 
15 from the eyes one week after the immunization, but before the 

following immunization. Serum was obtained by blood clothing, 

and centrifugation. 

The IgGi response was determined by use of the Balb/C mice 

Igd EL ISA method as described above. 
20 Results: 

Five weekly immunizations were required to elicit a 
detectable humoral response to the unmodified Humicola 
lanuginosa variant. The antibody titers elicited by the 
conjugate (i.e. lipase-SPEGIS, 000 ranged between 960 and 1920, 
25 and were only 2 to 4x lower than the antibody titer of 3840 
that was elicited by unmodified HL82-Lipolase (figure to the 
left) . 

The results of the tests axe shown in Figure 1 

As will be apparent to those skilled in the art, in the light 
30 of the foregoing disclosure, many alterations and modifications 
are possible in the practice of this invention without departing 
from the spirit or scope thereof. Accordingly, the scope of the 
invention is to be construed in accordance with the substance 
defined by the following claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
(i) APPLICANT: 
5 (A) NAME: Novo Nordisk A/S 

(B) STREET: Novo Alle 

(C) CITY: Bagsveard 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP): DK-2880 
10 (G) TELEPHONE: +45 4444 8888 

(H) TELEFAX: 4-45 4449 3256 

(ii) TITLE OF INVENTION: A modified polypeptide 

(iii) NUMBER OF SEQUENCES: 9 
(iv) COMPUTER READABLE FORM: 

15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 
(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 (EPO) 

2G |2) -INFORMATION FOR -SEQ-ID -NO : 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus sp. PD498, NCIMB No. ,40484 
( ix ) FEATURE : 
30 (A) NAME /KEY: CDS 

(B) LOCATION: 1. -840 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TGG TCA CCG AAT GAC CCT TAG TAT TCT GCT TAC CAG TAT GGA CCA CAA 48 
3 5 Trp Ser Pro Asn Asp Pro Tyr Tyr Ser Ala Tyr Gin Tyr Gly Pro Gin 
15 10 15 

AAC ACC TCA ACC CCT GCT GCC TGG GAT GTA ACC CGT GGA AGC AGC ACT 96 
Asn Thr Ser Thr Pro Ala Ala Trp Asp Val Thr Arg Gly Ser Ser Thr 
40 20 25 30 

CAA ACG GTG GCG GTC CTT GAT TCC GGA GTG GAT TAT AAC CAC CCT GAT 144 
Gin Thr Val Ala Val Leu Asp Ser Gly Val Asp Tyr Asn His Pro Asp 
35 40 45 

45 

CTT GCA AGA AAA GTA ATA AAA GGG TAC GAC TTT ATC GAC AGG GAC AAT 192 
Leu Ala Arg Lys Val lie Lys Gly Tyr Asp Phe lie Asp Arg Asp Asn 
50 55 60 

50 AAC CCA ATG GAT CTT AAC GGA CAT GGT ACC CAT GTT GCC GGT ACT GTT 240 
Asn Pro Met Asp Leu Asn Gly His Gly Thr His Val Ala Gly Thr Val 
65 70 75 80 

GCT GCT GAT ACG AAC AAT GGA ATT GGC GTA GCC GGT ATG GCA CCA GAT 288 
55 Ala Ala Asp Thr Asn Asn Gly lie Gly Val Ala Gly Met Ala Pro Asp 
85 90 95 

ACG AAG ATC CTT GCC GTA CGG GTC CTT GAT GCC AAT GGA AGT GGC TCA 336 
Thr Lys He Leu Ala Val Arg Val Leu Asp Ala Asn Gly Ser Gly Ser 
60 100 105 HO 

CTT GAC AGC ATT GCC TCA GGT ATC CGC TAT GCT GCT GAT CAA GGG GCA 384 

Leu Asp Ser He Ala Ser Gly He Arg Tyr Ala Ala Asp Gin Gly Ala 

115 120 125 

65 

AAG GTA CTC AAC CTC TCC CTT GGT TGC GAA TGC AAC TCC ACA ACT CTT 432 

Lys Val Leu Asn Leu Ser Leu Gly Cys Glu Cys Asn Ser Thr Thr Leu 
130 135 140 

70 AAG AGT GCC GTC GAC TAT GCA TGG AAC AAA GGA GCT GTA GTC GTT GCT 480 
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Lys Ser Ala Val Asp Tyr Ala Trp Aan Lys Gly Ala Val Val Val Ala 
145 150 155 160 

GCT GCA GGG AAT GAC AAT GTA TCC CGT ACA TTC CAA CCA GCT TCT TAC 528 
5 Ala Ala Gly Aen Asp Asn Val Ser Arg Thr Phe Gin Pro Ala Ser Tyr 
165 170 175 

CCT AAT GCC ATT GCA GTA GGT GCC ATT GAC TCC AAT GAT CGA AAA GCA 576 
Pro Asn Ala He Ala Val Gly Ala He Asp Ser Aen Asp Arg Lys Ala 
10 180 185 190 

TCA TTC TCC AAT TAC GGA ACG TGG GTG GAT GTC ACT GCT CCA GGT GTG 624 
Ser Phe Ser Asn Tyr Gly Thr Trp Val Asp Val Thr Ala Pro Gly Val 
195 200 205 

15 

AAC ATA GCA TCA ACC GTT CCG AAT AAT GGC TAC TCC TAC ATG TCT GGT 672 
Asn He Ala Ser Thr Val Pro Asn Asn Gly Tyr Ser Tyr Het Ser Gly 
210 215 220 

20 ACG TCC ATG GCA TCC CCT CAC GTG GCC GGT TTG GC T GCT TTG TTG GCA 720 
Thr Ser Met Ala Ser Pro His Val Ala Gly Leu Ala Ala Leu Leu Ala 
225 230 235 240 

AGT CAA GGT AAG AAT AAC GTA CAA ATC CGC CAG GCC ATT GAG CAA ACC 768 
25 Ser Gin Gly Lys Asn Asn Val Gin He Arg Gin Ala He Glu Gin Thr 
245 250 255 

GCC GAT AAG ATC TCT GGC ACT GGA ACA AAC TTC AAG TAT GGT AAA ATC 816 
Ala Asp Lys He Ser Gly Thr Gly Thr Asn Phe Lys Tyr Gly Lys He 
30 260 265 270 

AAC TCA AAC AAA GCT GTA AGA TAC 840 
Asn Ser Asn Lys Ala Val Arg Tyr 
275 280 

35 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 280 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

45 Trp Ser Pro Asn Asp Pro Tyr Tyr Ser Ala Tyr Gin Tyr Gly Pro Gin 
15 10 15 

Asn Thr Ser Thr Pro Ala Ala Trp Asp Val Thr Arg Gly Ser Ser Thr 
20 25 30 

Gin Thr Val Ala Val Leu Asp Ser Gly Val Asp Tyr Asn His Pro Asp 
35 40 45 



50 



Leu Ala Arg Lys Val He Lys Gly Tyr Asp Phe He Asp Arg Asp Asn 
55 50 55 60 

Asn Pro Met Asp Leu Asn Gly His Gly Thr His Val Ala Gly Thr Val 
65 70 75 80 

60 Ala Ala Asp Thr Asn Asn Gly He Gly Val Ala Gly Met Ala Pro Asp 
85 90 95 

Thr Lys He Leu Ala Val Arg Val Leu Asp Ala Asn Gly Ser Gly Ser 
100 105 110 

65 

Leu Asp Ser He Ala Ser Gly He Arg Tyr Ala Ala Asp Gin Gly Ala 
115 120 125 



Lys Val Leu Asn Leu Ser Leu Gly Cys Glu Cys Asn Ser Thr Thr Leu 
70 130 135 140 



- WO 98735026 



106 



PCT7DK98/00046 



Lys Ser Ala Val Asp Tyr Ala Trp Asn Lys Gly Ala Val Val Val Ala 
145 150 155 160 

5 Ala Ala Gly Asn Asp Asn Val Ser Arg Thr Phe Gin Pro Ala Ser Tyr 
165 170 175 

Pro Asn Ala lie Ala Val Gly Ala lie Asp Ser Asn Asp Arg Lys Ala 
180 185 190 

10 

Ser Phe Ser Asn Tyr Gly Thr Trp Val Asp Val Thr Ala Pro Gly Val 
195 200 205 

Asn He Ala Ser Thr Val Pro Asn Asn Gly Tyr Ser Tyr Met Ser Gly 
15 210 215 220 

Thr Ser Met Ala Ser Pro His Val Ala Gly Leu Ala Ala Leu Leu Ala 
225 230 235 240 

20 Ser Gin Glv Lvs Asn Asn Val Gin He Arg Gin Ala He Glu Gin Thr 
245 250 255 

Ala Asp Lys He Ser Gly Thr Gly Thr Asn Phe Lys Tyr Gly Lys He 
260 265 270 

25 

Asn Ser Asn Lys Ala Val Arg Tyr 
275 280 

(2) INFORMATION FOR SEQ ID NO: 3: 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNSSS: single 
(0) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

40 Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 

15 10 15 



45 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 



Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

55 Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 



60 



Ser Gly Ser Gly Ser Val Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 



Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
65 130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Gly Ser He Ser 
145 150 155 160 

70 Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
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165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NOt 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(vi) ORIGINAL SOURCE: 

(B) STRAIN: Arthromyces ramosus 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Gin Gly Pro Gly Gly Gly Gly Gly Ser Val Thr Cys Pro Gly Gly Gin 
15 10 15 

Ser Thr Ser Asn Ser Gin Cys Cys Val Trp Phe Asp Val Leu Asp Asp 
20 25 30 

Leu Gin Thr Asn Phe Tyr Gin Gly Ser Lys Cys Glu Ser Pro Val Arg 
35 40 45 

Lys He Leu Arg He Val Phe His Asp Ala He Gly Phe Ser Pro Ala 
50 55 60 

Leu Thr Ala Ala Gly Gin Phe Gly Gly Gly Gly Ala Asp Gly Ser He 
65 70 75 80 

He Ala His Ser Asn He Glu Leu Ala Phe Pro Ala Asn Gly Gly Leu 
85 90 95 

Thr Asp Thr He Glu Ala Leu Arg Ala Val Gly He Asn His Gly Val 
100 105 110 

Ser Phe Gly Asp Leu He Gin Phe Ala Thr Ala Val Gly Met Ser Asn 
115 120 125 

Cys Pro Gly Ser Pro Arg Leu Glu Phe Leu Thr Gly Arg Ser Asn Ser 
130 135 140 

Ser Gin Pro Ser Pro Pro Ser Leu He Pro Gly Pro Gly Asn Thr Val 
145 150 155 160 

Thr Ala He Leu Asp Arg Met Gly Asp Ala Gly Phe Ser Pro Asp Glu 
165 170 175 

Val Val Asp Leu Leu Ala Ala His Ser Leu Ala Ser Gin Glu Gly Leu 
180 185 190 

Asn Ser Ala He Phe Arg Ser Pro Leu Asp Ser Thr Pro Gin Val Phe 
195 200 205 
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Asp Thr Gin Phe Tyr lie Clu Thr Leu Leu Lys Gly Thr Thr Gin Pro 
210 215 220 

Gly Pro Ser Leu Gly Phe Ala Glu Glu Leu Ser Pro Phe Pro Gly Glu 
5 225 230 235 240 

Phe Arg Met Arg Ser Asp Ala Leu Leu Ala Arg Asp Ser Arg Thr Ala 
245 250 255 

10 Cys Arg Trp Gin Ser Met Thr Ser Ser Asn Glu Val Met Gly Gin Arg 

260 265 270 



15 



Tyr Arg Ala Ala Met Ala Lys Met Ser Val Leu Gly Phe Asp Arg Asn 
275 280 285 

Ala Leu Thr Asp Cys Ser Asp Val lie Pro Ser Ala Val Ser Asn Asn 
290 295 300 



Ala Ala Pro Val He Pro Gly Gly Leu Thr Val Asp Asp He Glu Val 
20 305 310 315 320 

Ser Cys Pro Ser Glu Pro Phe Pro Glu He Ala Thr Ala Ser Gly Pro 
325 330 335 

25 Leu Pro Ser Leu Ala Pro Ala Pro 

340 

(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
35 (vi) ORIGINAL SOURCE: 

(B) STRAIN: Humicola lanuginosa DSM 4109 
( ix ) FEATURE : 

(A) NAME /KEY: sig peptide 

(B) LOCATION : 1 . . 66 
40 (ix) FEATURE: 

<A) NAME /KEY : mat peptide 

(B) LOCATION: 67.. 576 
( ix ) FEATURE : 

(A) NAME/KEY: CDS 
45 (B) LOCATION : 1 . . 876 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG AGG AGO TCC CTT GTG CTG TTC TTT GTC TCT GCG TGG ACG GCC TTG 48 
Met Arg Ser Ser Leu Val Leu Phe Phe Val Ser - Ala Trp Thr Ala Leu 
50 -22 -20 -15 -10 

GCC AGT CCT ATT CGT CGA GAG GTC TOG GAG GAT CTG TTT AAC CAG TTC 96 
Ala Ser Pro He Arg Arg Glu Val Ser Gin Asp Leu Phe Asn Gin Phe 
-5 1 5 .10 

55 

AAT CTC TTT GCA CAG TAT TCT GCA GCC GCA TAC TGC GGA AAA AAC AAT 144 
Asn Leu Phe Ala Gin Tyr Ser Ala Ala Ala Tyr Cys Gly Lys Asn Asn 
15 20 25 

60 GAT GCC CCA GCT GGT ACA AAC ATT ACG TGC ACG GGA AAT GCC TGC CCC 192 
Asp Ala Pro Ala Gly Thr Asn He Thr Cys Thr Gly Asn Ala Cys Pro 
30 35 40 

GAG GTA GAG AAG GCG GAT GCA ACG TTT CTC TAC TCG TTT GAA GAC TCT 240 
65 Glu Val Glu Lys Ala Asp Ala Thr Phe Leu Tyr Ser Phe Glu Asp Ser 
45 50 55 

GGA GTG GGC GAT GTC ACC GGC TTC CTT GCT CTC GAC AAC ACG AAC AAA 288 
Gly Val Gly Asp Val Thr Gly Phe Leu Ala Leu Asp Asn Thr Asn Lys 
70 60 65 70 
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TTG ATC GTC CTC TCT TTC CGT GGC TCT CGT TCC ATA GAG AAC TGG ATC 336 
Leu lie Val Leu Ser Phe Arg Gly Ser Arg Ser lie Glu Asn Trp lie 
75 80 85 90 

5 

GGG AAT CTT AAC TTC GAC TTG AAA GAA ATA AAT GAC ATT TGC TCC GGC 384 
Gly Asn Leu Asn Phe Asp Leu Lys Glu lie Asn Asp lie Cys Ser Gly 
95 100 105 

10 TGC AGG GGA CAT GAC GGC TTC ACT TCG TCC TGG AGG TCT GTA GCC GAT 432 
Cys Arg Gly His Asp Gly Phe Thr Ser Ser Trp Arg Ser Val Ala Asp 
110 115 120 

ACG TTA AGG CAG AAG GTG GAG GAT GCT GTG AGG GAG CAT CCC GAC TAT 480 
15 Thr Leu Arg Gin Lys Val Glu Asp Ala Val Arg Glu His Pro Asp Tyr 
125 130 135 

CGC GTG GTG TTT ACC GGA CAT AGC TTG GGT GGT GGA TTG GCA ACT GTT 528 
Arg Val Val Phe Thr Gly His Ser Leu Gly Gly Ala Leu Ala Thr Val 
20 140 145 150 

GCC GGA GCA GAC CTG CGT GGA AAT GGG TAT GAT ATC GAC GTG TTT TCA 576 
Ala Gly Ala Asp Leu Arg Gly Asn Gly Tyr Asp lie Asp Val Phe Ser 
155 160 165 170 



25 



TAT GGC GCC CCC CGA GTC GGA AAC AGG GCT TTT GCA GAA TTC CTG ACC 624 
Tyr Gly Ala Pro Arg Val Gly Asn Arg Ala Phe Ala Glu Phe Leu Thr 
175 180 185 



30 GTA CAG ACC GGC GGA ACA CTC TAG CGC ATT ACC CAC ACC AAT GAT ATT 672 
Val Gin Thr Gly Gly Thr Leu Tyr Arg He Thr His Thr Asn Asp lie 
190 195 200 

GTC CCT AGA CTC COG CCG CGC GAA TTC GGT TAC AGC CAT TCT AGC CCA 720 
35 Val Pro Arg Leu Pro Pro Arg Glu Phe Gly Tyr Ser His Ser Ser Pro 
205 210 215 

GAG TAC TGG ATC AAA TCT GGA ACC CTT GTC CCC GTC ACC CGA AAC GAT 768 
Glu Tyr Trp He Lys Ser Gly Thr Leu Val Pro Val Thr Arg Asn Asp 
40 220 225 230 

ATC GTG AAG ATA GAA GGC ATC GAT GCC ACC GGC GGC AAT AAC CAG CCT 816 
He Val Lys He Glu Gly He Asp Ala Thr Gly Gly Asn Asn Gin Pro 
235 240 245 250 



45 



AAC ATT COG GAT ATC CCT GCG CAC CTA TGG TAC TTC GGG TTA ATT GGG 864 
Asn He Pro Asp He Pro Ala His Leu Trp Tyr Phe Gly Leu He Gly 
255 260 265 



50 ACA TGT CTT TAG 876 
Thr Cys Leu * 
270 

(2) INFORMATION FOR SEQ ID NO: 6: 
55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 292 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Arg Ser Ser Leu Val Leu Phe Phe Val Ser Ala Trp Thr Ala Leu 
-22 -20 -15 -10 

65 Ala Ser Pro He Arg Arg Glu Val Ser Gin Asp Leu Phe Asn Gin Phe 
-5 1 5 10 



70 



Asn Leu Phe Ala Gin Tyr Ser Ala Ala Ala Tyr Cys Gly Lys Asn Asn 
15 20 25 
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Asp Ala Pro Ala Gly Thr Asn He Thr Cys Thr Gly Asn Ala Cys Pro 
30 35 40 

Glu Val Glu Lys Ala Asp Ala Thr Phe Leu Tyr Ser Phe Glu Asp Ser 
5 45 50 55 

Gly Val Gly Asp Val Thr Gly Phe Leu Ala Leu Asp Asn Thr Asn Lys 
60 65 70 

10 Leu lie Val Leu Ser Phe Arg Gly Ser Arg Ser He Glu Asn Trp He 
75 80 85 90 

Gly Asn Leu Asn Phe Asp Leu Lys Glu He Asn Asp He Cys Ser Gly 
95 100 105 

15 

Cys Arg Gly His Asp Gly Phe Thr Ser Ser Trp Arg Ser Val Ala Asp 
110 115 120 

Thr Leu Arg Gin Lys Val Glu Asp Ala Val Arg Glu His Pro Asp Tyr 

20 125 130 135 

Arg Val Val Phe Thr Gly His Ser Leu Gly Gly Ala Leu Ala Thr Val 
140 145 150 

25 Ala Gly Ala Asp Leu Arg Gly Asn Gly Tyr Asp lie Asp Val Phe Ser 
155 160 165 170 

Tyr Gly Ala Pro Arg Val Gly Asn Arg Ala Phe Ala Glu Phe Leu Thr 
175 180 185 

30 

Val Gin Thr Gly Gly Thr Leu Tyr Arg He Thr His Thr Asn Asp He 
190 195 200 

Val Pro Arg Leu Pro Pro Arg Glu Phe Gly Tyr Ser His Ser Ser Pro 
35 205 210* 215 

Glu Tyr Trp He Lys Ser Gly Thr Leu Val Pro Val Thr Arg Asn Asp 
220 225 230 

40 He Val Lys He Glu Gly He Asp Ala Thr Gly Gly Asn Asn Gin Pro 
235 240 245 250 

Asn He Pro Asp He Pro Ala His Leu Trp Tyr Phe Gly Leu He Gly 
255 260 265 

45 

Thr Cys Leu * 
270 

50 (2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "R28K oligo" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

60 gggatgtaac caagggaagc agcactcaaa eg 32 

<2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
65 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = -R62K digo" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
5 cgactttatc gataaggaca ataaccc 27 



(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "R169K oligo" 
15 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



caatgtatcc aaaacgttcc aaccagc 



27 
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Patent Claims 

1. A polypeptide-polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
5 polypeptide, having been modified in a manner to increase the 

number of attachment groups on the surface of the polypeptide, in 
comparison to the number of attachment groups available on the 
corresponding parent polypeptide, and/or 

b) one or more fewer polymeric molecules coupled to the 
10 polypeptide, having been modified in a manner to decrease the 

number of attachment groups at or close to the functional site(s) 
of the polypeptide, in comparison to the number of attachment 
groups available on the corresponding parent polypeptide. 

2. The conjugate according to claims 1, having 1 to 25, 
15 preferably 1 to 10 additional polymeric molecules coupled to the 

surface of the polypeptide in comparison to the number of 
polymeric molecules of a conjugate prepared from the corresponding 
parent enzyme. 

3. The conjugate according to claims 1 and 2, wherein the 
20 additional attachment group (s) is (are) amino groups in the form of 

Lysine residues (s), or carboxylic groups in the form of Aspartic 
acid or Glutamic acid residues. 

4. The conjugate according to any of claims 1 to 3, wherein 
the additional attachment group (s) is (are) prepared by a 

25 conservative substitution of an amino acid residue, such as an 
Arginine to Lysine substitution. 

5. The conjugate according to claims 1 to 3, wherein the 
additional attachment group(s) is (are) prepared by a conservative 
substitution of an amino acid, such as an Aspargine to 

30 Aspartate/ Glutamate or a Glutamine to Aspartate/ Glutamate 
substitution. 

6. The conjugate according to any of claims 1 to 5, wherein 
the added attachment group is located more than 5 A, preferably 8 
A, especially 10 A from the functional site. 

35 7 . The conjugate according to claim 1 , having 1 to 25 

preferably 1 to 10 fewer polymeric molecules coupled at or close 
to the functional site of the polypeptide in comparison to the 
number of polymeric molecules of a conjugate prepared on the basis 
of the corresponding parent polypeptide. 
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8. The conjugate according to claim 7, wherein the removed 
attachment group (s) is (are) amino groups in the form of Lysine 
residues (s), or carboxylic groups in the form of Aspartic acid or 
Glutamic acid residues. 
5 9. The conjugate according to any of claims 7 and 8, wherein 
the removed attachment group (s) is (are) prepared by a conservative 
substitution of an amino group, such as Lysine to Arginine 
substitution. 

10. The conjugate according to any of claims 7 to 8, wherein 
10 the removed attachment group (s) is (are) prepared by a conservative 

substitution of a carboxylic group, such as an Aspartate/Glutamate 
to Aspargine or Aspartate/Glutamate to a Glutamine substitution. 

11. The conjugate according to any of claims 1 to 10, wherein 
the removed attachment group is located within 5 A, preferably 8 

15 A, especially 10 A from the functional site. 

12. The conjugate according to any of ^claims 1 to 11, wherein 
the attachment groups are broadly spread. 

13. The conjugates according to claims 1 to 12, wherein the 
parent polypeptide moiety of the conjugate has a molecular weight 

20 from 1 to 100 kDa, preferred 15 to 100 kDa. 

14. The conjugate according to claim 13, wherein the parent 
polypeptide moiety of the conjugate has a molecular weight of from 
1 to 35 kDa. 

15. The conjugates according to claim 14, wherein the parent 
25 polypeptide is an en2yme selected from the group of 
Oxidoreductases, including laccases arid Superoxide disrautase 
(SOD); Hydrolases, including proteases, especially subtilisins, 
and lipolytic enzymes; Transferases, including Transglutaminases 
(TGases) ; Isomerases, including Protein disulfide Isomerases 
30 (PDI) . 

16. The conjugate according to claim 15, wherein the parent 
enzyme is PD498, Savinase®, BPN' , Proteinase K, Proteinase R, 
Subtilisin DY, Lion Y, Rennilase®, JA16, Alcalase® or a Humicola 
lanuginosa lipase, such as Lipolase®. 
35 17. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a PD498 variant with one or more of the 
following substitutions: R51K, R62K, R121K, R169K, R250K, R28K, 
R190K, P6K, Y7K, S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, N65K, 
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G87K, I88K, N209K, A211K, N216K, N217K, G218K, Y219K, S220K, 
Y221K, G262K. 

18. The conjugate according to claim 17, with one of the 
following mutations: R28K+R62K, R28K+R169K, R62K + R169K, 
5 R28K+R69K+R169K. 

19. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a Savinase® variant with one or more of 
the following substitutions: R10K, R19K, R45K, R145K, R170K, 
R186K, R247K, K94R, P5K, P14K, T22K, T38K # H39K, P40K, L42K, 

10 L75K, N76K, L82K, P86K, S103K, V104K, S105K, A108K, A133K, 
T134K, L135K, Q137K, N140K, N173K, N204K, Q206K, G211K, S212K, 
T213K, A215K, S216K, N269K. 

20. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a Hxxmicola lanuginosa lipase variant 

15 with one or more of the following substitutions: 

R133K,R139K,R160K,R179K f R209K,R118K,R125K,A18K,G31K,T32K, 
N33K,G38K,A40K,D48K,T50K / E56K,D57K,S58K,G59K,V60K,G61K,D62K, 
T64K,L78K,E87K,N88K,G91K,N92K,L93K,S105K,G106K,V120K,P136K / G225 
K,L227K,V228K,P229K / P250K,D254K,F262K. 

20 21. The conjugate according to claim 20 with the following 

mutations E87K+D254K. 

22. The conjugate according to any of claims 1 to 21, wherein 
the polymeric molecules coupled to the polypeptide have a 
molecular weight from 1 to 60 kDa, especially 1-35 kDa, especially 

25 3 to 25 kDa. 

23. The conjugate according to claim 22, wherein the poly- 
meric molecule is selected from the group comprising a natural or 
synthetic homo- and heteropolymers, selected from the group of the 
synthetic polymeric molecules including Branched PEGs, poly-vinyl 

3 0 alcohol (PVA) , poly-carboxyl acids, poly-(vinylpyrolidone) and 
poly-D,L-amino acids, or natural occurring polymeric molecules 
including dextrans, including carboxymethyl-dextrans, and 
celluloses such as methylcellulose, carboxymethylcellulose, 
ethylcellulose, hydr oxyethy Ice llu lose, hydroxypropylcellulose, and 

35 hydrolysates of chitosan, starches, such as hydroxy ethyl -starches, 
hydroxypropy 1-starches , glycogen , agarose , guar gum , inulin , 
pullulans, xanthan gums, carrageenin, pectin and alginic acid. 

24. A method for preparing improved polypeptide-polymer 
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conjugates comprising the steps of: 

a) identifying amino acid residues located on the surface of the 
3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
5 structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a suitable 
attachment group, and/or 

ii) substituting or deleting one or more amino acid residues 
10 selected in step b) at or close to the functional site, 

d) coupling polymeric molecules to the mutated polypeptide. 

25. The method according to claim 24, wherein the 
identification of amino acid residues located on the surface on 
the polypeptide referred to in step a) are performed by a computer 

15 program analyzing the 3D structure of the parent polypeptide in 
question. 

26. The method according to claim 24, wherein step b) 
comprises selecting Arginine or Lysine residues on the surface of 
the parent polypeptide. 

20 27. The method according to claim 24, wherein one or more 
Arginine residues identified in step b) is (are) substituted with a 
Lysine residue (s) in step c) . 

28 . The method according to claims 27 , wherein the 
substituted Arginine residues have a distance of more than 5 A, 

25 preferably 8 A , especially 10 A from the functional site. 

29. The method according to any of claims 24 to 28, wherein 
the polypeptide prepared in step , c) is coupled to polymeric 
molecules . 

30. Use of the conjugate in claims 1 to 23 for reducing the 
30 allergenicity of industrial products. 

31. Use of the conjugate in claims 1 to 23 for reducing the 
immunogenicity of pharmaceuticals. 

32. A composition comprising a conjugate of any of claims 1 
to 23 and further comprising ingredients used in industrial 

35 products. 

33. The compositi n according to claim 32, wherein the 
industrial product is a detergent, such as a laundry, dish wash or 
hard surface cleaning product, or a food or feed product. 
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34. The composition according to claim 32, comprising a 
conjugate of any of claims 1 to 22 and further ingredients used in 
skin care products. 

35. A composition comprising a conjugate of any of claims 1 
to 23 and further comprising ingredients used in pharmaceuticals. 
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